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Chapter 1 

Introduction 



Quantum Chromodynamics ( QCD) describes the interactions of quarks and gluons, 
particles which carry a charge cahed color. For small momentum transfer, quantum 
fluctuations of interactions between color charged particles become so important that 
they cannot be treated any more as small corrections. In fact, the interactions become so 
strong that on larger scales (> Ifm), quarks and gluons are never observed individually, 
but are confined in hadrons, composite particles which have no net color. Nucleons 
(neutrons and protons), the constituents of atomic nuclei, belong to the class of hadrons. 
Analyzing their inner structure in terms of QCD with the usual method of perturbation 
theory fails due to the strong interactions [TWDlj . A different approach is needed. In 
the long run, it is vital to describe hadrons quantitatively using Standard Model QCD. It 
helps us to gain accuracy in the interpretation of low- and high-energy experiments, adds 
to our understanding of nuclear physics and plays a role in the attempt to go beyond the 
Standard Model. 

One idea is to set up an Effective Field Theory {EFT), like Chiral Perturbation 
Theory {xPT). Instead of quarks and gluons, the theory is formulated in terms of the 
composite particles [effective degrees of freedom) that are thought to be physically 
relevant in the regime of interest, e.g., pions and nucleons. A systematic approach guar- 
antees that the symmetry features of QCD are preserved in xPT. A disadvantage is the 
infinite number of interaction terms and coupling parameters [low energy constants, or 
LECs) that enter xPT. Among other things, they encode the form factors, i.e. the shape 
and size of the composite particles. In practice, one works with an expansion in small pa- 
rameters. Then, only a finite number of interaction terms yield significant contributions. 
The numerical values of the coupling parameters are a priori unknown. Usually they are 
determined by fitting the parametric expressions of a set of observables to empirical data. 
Once they are known, predictions for observables inaccessible to experiment are feasible. 
However, the necessity to make use of empirical data conflicts with the wish to calculate 
hadron properties ab initio, i.e., from Standard Model parameters only. 

This is the point where numerical Lattice Field Theory calculations become useful. 
These calculations evaluate the dynamics of quark and gluon fields numerically in a four 
dimensional cube of space-time. Continuous fields are approximated by the field values 
specified at lattice points inside the cube. This is a good approximation as long as the cube 
is larger than the hadrons and the lattice is finer than structural sizes within hadrons. Thus 
by putting the fields on the lattice, we have a method that permits ab-initio calculations of 
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masses, form factors and other properties of hadrons. Since calculations within Quantum 
Field Theory require in principle the consideration of all possible field configurations, 
lattice calculations become numerically very expensive. In order to obtain results on 
modern computers, the up- and down-quark masses are set to much higher values than 
they are in physical reality. Simulations at the true quark masses still seem to be a long 
way off. Thus one needs some kind of extrapolation to lower quark masses in order to 
extract physical quantities. 

The obvious means for extrapolations of lattice QCD results is chiral perturbation theory, 
which offers quark mass dependent formulas for observables in the low energy domain. 

The prime examples of quantities that can be calculated using lattice QCD are the pion 
mass and the nucleon mass.^ Experiments yield for the pion mass m^'^'' and the nucleon 
mass rri^^^ realized in nature 

^Phys ^ Q_i4 , mPl'y' « 0.94 GeV 

With highly accurate experimental results available, why does one engage in complicated 
calculations for these two observables? Firstly, they can show us how well lattice extra- 
polations really work and what obstacles to look out for. Secondly, it is possible to extract 
information from the quark mass dependence that is difficult or impossible to measure in 
experiments. Last but not least, it would be a brilliant scientific feat to demonstrate that 
the nucleon mass really follows from the Standard Model. 

Lattice data with quark masses low enough to attempt extrapolation using xPT have 
become available only recently and are still scarce. Promising results have been published 
in reference |PHW04] . showing a successful fit to nucleon mass data from the lattice. Not 
all information entering this fit comes from lattice calculations. Some of the LECs have 
been fixed to values obtained from empirical data. Additionally, the authors included the 
nucleon mass at the physical point as input. 

My task in this diploma thesis is to discuss the reliability, possible obstacles and the future 
perspective of chiral extrapolations, with particular focus on the nucleon mass. How large 
are the statistical uncertainties of the fit in ref. |PHW04] ? Do we observe convergence of 
the perturbative series? Are the assumptions made well founded? Are the extracted LECs 
compatible with empirical information? Will it be possible any time soon to perform the 
fit with information purely from the lattice? This work cannot give final solutions, but it 
can present tools and strategies for the future, when more data become available. 

The following four chapters are intended to give novices a quick-start introduction to the 
mechanisms and terminology of the theoretical fields that have been brought together in 
this work: Chiral Perturbation Theory, Lattice Field Theory and error analysis. Further- 
more, chapter 4 describes the algorithms used for the statistical analysis. 

Chapter 6 constitutes the principal part and is devoted to fitting xPT to two-flavor lattice 
data for the nucleon mass. It starts out with a discussion of the existing fit of ref. |PHW04] . 
A simple numerical experiment reveals important insight about the strategy needed to 
perform ab-initio extrapolations. In an attempt to match our results to empirical pion- 
nucleon scattering measurements, we stumble over discrepancies, but we resolve the issue 



^At this stage, one can neglect the comparatively small difference between neutron and proton mass 
due to isospin breaking. 
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by taking effects of the delta resonance into account. Finally, use of a finite volume 
correction formula extends the selection of lattice data usable for our fit. 

The pion mass is a crucial input variable in many chiral extrapolations. Its precise de- 
pendence on the quark mass has yet to be clarified. Chapter 7 presents a step in that 
direction, making use of a simultaneous fit to pion and kaon mass from three-flavor lattice 
data. The chapter also demonstrates the transfer of three-flavor results to the two-flavor 

framework. 

Finally, chapter 8 gives a brief summary of my conclusions. 
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Chapter 2 



Basics of Relativistic Baryon 
Chiral Perturbation Theory 

This chapter introduces the basic concepts of xPT. The focus rests on conceptual under- 
standing and issues relevant for application. 

2.1 The QCD Lagrangian 

Chiral Perturbation Theory is an Effective Field Theory for the basic gauge invariant QCD 
Lagrangian 

■CqCD = Qfa 0ab 9fb - 9fa "^fg Qga. - -^Qp^vA (2-1) 

with flavor indices f, g = {n, d, s, ...} for the nf quark flavors, color indices a, b = 1..3, 

Lorentz indices /i, z^, ... and indices A, B, C = 1..8 numbering the generators of the color 
group. Einstein's summation convention is implied. The Dirac spinors gfa = qia,{x) repre- 
sent the quarks. The QCD Lagrangian is set up in such a way that it is invariant under 
local SU(3) gauge transformations 

^ W^ab(x) (7fb(x) (2.2) 

for any (space dependent) unitary unimodular 3x3 color matrix VFab(a^)- This is achieved 
by introducing the covariant derivative 

^^^ab gfb = dtiQis. - ig^^^^^AiuA gfb (2.3) 

where Aab,A for A = L.8 are the generators of SU(3), the Gell-Mann matrices, g is 
the strong coupling constant, and the bosonic gauge fields A^a = Aij,a{x) describe the 
gluons. Their transformation behavior is fixed by the requirement of gauge invariance. The 
propagation of the gluons is assured by the gauge invariant kinetic gluon term involving 
the field strength tensors 

Gi^uA = d^A^A - d^Ai^A + 9 /abc Ai^b AvC (2.4) 

where /abc arc the structure constants of the color group SU(3). In a perturbative picture, 
the term involving the structure constants is responsible for interaction vertices of three 
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or four gluon lines. In the low energy domain, these gluonic self-interactions abound, to 
the point where the perturbative picture becomes inappropriate altogether. 

In practice, not all quark flavors need to be considered in our Lagrangian (|2.1|) . For our 
applications, energies are far too low to excite states involving the heavy quarks charm, 
bottom or top. They can therefore be ^Hntegrated out". In first approximation, they 
only cause slight shifts in the values of the quark masses mfg and the coupling g. A theory 
with three flavors "up" , "down" and "strange" has nf = 3 and is therefore typically called 
an SU(3) theory. For SU(2) theories with nf = 2, even the strange quark is neglected. 

The quark mass matrix mfg is chosen diagonal: 

nf = 2 : m = f ^ (2.5) 

(ruu 
md I (2-6) 

Since we are not interested in isospin breaking eff'ects, we always set m„ = = rn. 

The QCD Lagrangian is a real Lorentz scalar and even under the discrete symmetries C, 
P and T. 



2.2 Chiral Symmetry 

The more we know about mutual symmetry transformation properties of the EFT's ele- 
mentary degrees of freedom, the less terms are allowed in the effective Lagrangian. This 
motivates further inquiries about the symmetries of the theory. We concentrate on flavor 
symmetries. For the purpose of clarity, we omit the color indices a, b,c. 

We define left- and right-handed {chiral) components of the quark fields = ^(1 — 75)'7f 
and = ^(1 -|- 75)(?f . In terms of these fields, the Lagrangian (|2.H) appears as 

^QCD = 'filPq^ +^iU)qf 

— — (2-'j 

So for mfg = 0, called the chiral limit, the QCD Lagrangian (|2.1|) is invariant under 
global transformations in flavor space 

qf' ^ Lfg q^ , Qf^ ^ i?fg q^ (2.8) 

The transformation matrices L and R must be unitary. If L = R = V, we will speak 
of a vector transformation, while the case = R = A shall be called an axial 
transformation. We observe that the QCD Lagrangian in the chiral limit exhibits a 
global U(nf)/, Cgi U(nf)/j symmetry. The mass term breaks this symmetry explicitly. 

The group SU(nf) has = — 1 generators Afg^F ) where F is running from 1 to np. It 
can be extended to U(nf) with an extra U(l) generator Afg^o •= ^ig- Using this notation, 
the 2nf conserved symmetry currents read 

= ? J% = ? (2.9) 



2.3 Spontaneous Symmetry Breaking 
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It is useful to define the following linear combinations 

J F = J F+J F = 1i Qg vector current 

J F — J F ~ J F — 7 75 — I — axial vector current 
The corresponding charges are 

Q'f{t) = J <fx flix,t) and Q^it) = J (fx j'^lix,t) (2.10) 

The vector charges generate vector transformations, which form a subgroup called 
U(l)y (X" SU(nf)v'. Axial charges can be used to generate infinitesimal axial transfor- 
mations. However, the axial transformations do not form a group, because commutators 
of axial charges do not yield pure axial charges again. Even though, one speaks of an 
U(l)^ SU(nf)yi symmetry. Thus in total we say that the Lagrangian in the chiral limit 
exhibits a 11(1)1/ ® SU(nf)v' and a U(l)^ SU(nf)^ symmetry. 

Only a part of the above symmetries remain valid for our EFT, even in the chiral limit. 
Firstly, the U(l)yi-symmetry is not preserved by quantization (axial anomaly), such 
that the QCD Hamilton operator Hq^^) for massless quarks is only invariant under 
SU(nf)y (8'U(l)y and SU(nf)A transformations. Secondly, empirical facts strongly suggest 
that the vacuum state does not share the Hamiltonian's invariance under S\]{n{)A- This 
phenomenon is called spontaneous symmetry breaking. 



2.3 Spontaneous Symmetry Breaking 

Candidates for vacua in the spectrum of eigenstates of -^qcd are states for which 
the momentum eigenvalue zero is discrete. This set of states can be chosen orthonormal. 
The rest of the Hilbert space can be described by an orthonormal set of states \N,p) 
carrying three-momenta p. The vacuum may be a linear combination of states belonging 
to a degenerate subset of the 

According to an argument found in ref. |Wei96j . local operators do not permit transitions 
among the l^j) in infinite volume. To see this, consider two Hermitian operators at equal 
times A(x) and B{0). Now 

{nj\A{x)B{o)\ni) = ^{nj\A{o)\nk){^k\B{o)\ni) 

k 

+ / d'pY,i^j\MO)\N,p){N,p\BiO)\ni)e-'P'^ 
^ A{Q) B{0) (2.11) 



x|— >oo 

(where we have assumed that the p-dependent matrix elements are Lebesque-integrable) . 
Causality requires the equal-time commutator [^(x),i?(0)] to vanish. Therefore 

= {nj\[Aix),B{o)]\ni) 

> V(%|A(0)|l^fc)(f^fc|S(0)|17i)-(%|S(0)|17fc)(l^fc|yl(0)|17i) (2.12) 
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The right hand side tells us, that with respect to the vacuum states, any two local operators 
commute. They can therefore all be simultaneously diagonalized, i.e. the can be 
chosen in such a way that 

(%| ^(0) l^i) (X 5ji for all local operators ^(0) (2.13) 

In particular, a small symmetry breaking perturbation built out of local operators (such as 
the mass term) will be diagonal with respect to the vacuum states, can break degeneracies 
and can thus select one of the vacuum states |r2o) as the one realized in nature. So we 
see that the vacuum is in general not a linear combination of the that forms a singlet 
under the symmetries of the Hamiltonian, but rather a state which is stable with respect 
to local symmetry breaking perturbations. 

According to |VW84j . the QCD ground state in the chiral limit must be invariant under 
SU(nf)v' (8'U(l)v {isospin). However, the possibility remains, that the state(s) of lowest 
energy are not invariant under SU(nf)^. Indeed, observations indicate that this is the 
case. 

Operators with non-trivial transformation behavior under unbroken symmetries must have 
the vacuum expectation value zero. Otherwise their expectation value would change under 
symmetry transformations - a contradiction to the invariance of the vacuum. Operators 
invariant under all unbroken symmetries, however, may take non-zero vacuum expectation 
values. An example is the scalar quark density operator qi{x)q{{x). It is invariant under 
the unbroken symmetries U(nf)v'. We switch to matrix-vector notation with respect to 
the indices f,g,... and drop position arguments: 

q={qi{x),...,qnf{x)) , q = { qi{x), qn,{x) (2.14) 

Under infinitesimal axial rotation L'^ = R = A = 1 + ieFXF/2, the scalar quark density qq 
develops pseudoscalar components of the form q^^Xpq (F = L.np) : 

qq q^AAq^ + q^A^A^q^ = m + if-F {q^Xpq^ - q^Xpq^) = m + ieFqi^Xpq (2.15) 
The latter transform as vectors under vector rotations L = R = V = 1 + ieQXQ/2 : 

gT^Apg ^ '^V^XpVq'^ - ^V^XpVq'' = q-f'Xpq + '-ec Q7'[Af, Ag]? 

= qj^Xpq - /FGHec qi^Xnq (2.16) 

and must therefore possess vanishing vacuum expectation values. Together with a numer- 
ical estimate ^, we write down the vacuum alignment conditions as 

{no\qq\^o) /ni- {270 GeVf , {QqI qi' ^Fql^o) = 0, F = l..{n^ - 1) (2.17) 

In a coordinate system with the different di-quark densities as the axes, the vacuum 
alignment condition can be thought of as an arrow, starting at the origin and pointing 
to the di-quark density realized in the QCD vacuum. Here the length of the arrow is 
the value of the scalar quark condensate (qq). It would be zero if the symmetry were 
unbroken, which is why it is called an order parameter. In principle, we could choose a 



MS, scale 2 GeV, see e.g. |(:jHROl| 



2.4 The Goldstone Boson Field 
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different vacuum alignment, shifting the non-zero vacuum expectation values to other di- 
quark densities. Then, however, we would also have to modify our definition of vector and 
axial rotations, because the vacuum must be invariant under vector rotations. (It should 
be noted that the di-quark condensates are not the only quantities that characterize the 
vacuum. Spontaneous symmetry breaking could even be realized with (qq) = 0, because 
other objects, such as four-quark condensates, may independently exhibit nonzero values.) 

Excitations of the vacuum can modify the align- 
ment locally. If we imagine di-quark density ar- 
rows of the kind discussed above at every point 
in space, we would see parallel arrows everywhere 
except in the region of the excitation, see left in- 
set. A misalignment causes tension, which leads to 
a wavelike propagation and spreading of the mis- 
alignment. The more gradual the misalignment, 
i.e. the larger the wave length, the less tension 
energy. In the limit of an infinite wave length, the 
energy of the misalignment goes to zero - the dis- 
persion relation starts out at the origin |(Teo84j . 
Thus local vacuum misalignments manifest them- 
selves as massless particles, the so called Goldstone Bosons. Their existence in a sponta- 
neously broken field theory can be shown quite generally |Wei96j . They play an important 
role in our theory, because due to their masslessness, we expect them to dominate the low 
energy behavior of QCD. In reality, with explicit symmetry breaking interactions present, 
the Goldstone Bosons are identified as the lightest particles in the spectrum of QCD. 
For rif = 2, these are the pions vr"*", vr", vr*^. For nf = 3, one includes additionally light 
excitations involving strangeness: K~^, , K^, and r/. 
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2.4 The Goldstone Boson Field 



To explore the dynamics of such vacuum misalignments, we must single out local axial 
rotations from the fields. We write down the quark fields in the form 



L(x) 



( \ 



R{x) 



( \ 



(2.18) 



with unitary rif x ?if flavor rotation matrices L(x), R{x). These can be expressed as 
products of a unitary axial rotation u{x) and a unitary vector rotation v{x): 



L{x) = u(x)^ v{x) 



R{x) = u{x) v{x) 



(2.19) 



Multiplying the second equation from the right with the Hermitian conjugate of the first 
equation gives 

R{x) L{xy = u{x) u(x) = U{x) (2.20) 
For U{x) we can read off a simple transformation law under global chiral rotations L, R: 



'{x) L q^{x) , q^{x) R g"(j;) 



-.R( 



U{x) R U{x) 



(2.21) 
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Equation (|2.18|) may now be reexpressed as 



u{x)^v{x) 



( \ 



( \ 



(2.22) 



u(x) v(x) 







= u{x) q^{x) 



(2.23) 



The new quark fields q{x) are constrained. They encode less degrees of freedom, because 
the degrees of freedom corresponding to local axial rotations have been taken out and put 
into u{x). We parametrize U{x), and along with it u{x), in terms of real fields 4>f: 



U (x) = exp ( i 



Af_0f(^ 

fO 

J TV 



(2.24) 



The ground state corresponds to U{x) = l„j and (pF^x) = 0. The meaning of the normal- 
ization constant will become clear later. The fields (Pf{x) excite Goldstone Bosons. For 
example, in an SU(2) theory, the pion fields are mapped to linear combinations of the (pF 
in the following way: 



Af (pF 



93 91 - ^92 



V2' 



-TT" 



(2.25) 



A parity transformation P exchanges right- and left-handed quark fields. From our rep- 
resentation l|2.23() . it follows that u transforms into its Hermitian conjugate under parity. 
For the Goldstone Boson fields (pF in turn, this implies a change of sign: 

P</)F(t,x)P = -(/>F(t,-f) (2.26) 

Thus Goldstone Bosons have negative parity. They form an isospin vector, because under 
infinitesimal vector transformations L = R = V = 1 + ieQXQ/2, we find 

U^VUV^ (Pf^(Pf- fFGneGcpFi (2.27) 



We could now rewrite the Lagrangian in terms of the constrained quark Fermion fields q{x) 
and the above parametrization of u{x) in terms of Bosonic fields (pi^py We are not going 
to do this, however, because we are interested in the low energy limit of QCD dynamics, 
where quarks are confined and are not the observed degrees of freedom. Instead, we start 
building an effective theory. 



2.5 Chiral Perturbation Theory for the Goldstone Bosons 

The general idea of an Effective Field Theory is to introduce fields for all excitations that 
are in resonance in the process under study. The fields come in multiplets of known behav- 
ior under symmetry transformations. In our framework, the theory contains the Goldstone 



2.5 Chiral Perturbation Theory for the Goldstone Bosons 
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Bosons (pions, ...) and the Ughtest baryon multiplet (proton, neutron, ...). Formally, the 
Lagrangian contains all interaction terms compatible with the underlying symmetries of 
QCD, where each term contains a different coupling constant. These couplings implicitly 
encode the rest of the spectrum. In xPT, the underlying symmetries are those in the chiral 
limit. Quark masses are introduced as perturbations using a trick: The quark mass matrix 
mfg is interpreted as a constant external field with a special transformation behavior. 

We start out by building a Lagrangian in the limit of vanishing quark masses with Gold- 
stone Boson fields only. Thus the only fields we take over from our discussion of the QCD 
Lagrangian are those characterizing the local perturbations of the vacuum alignment: the 
fields collected in U{x). The most general, chirally invariant, effective Lagrangian density 
with the minimal number of derivatives |Schn2j is 

( f0\2 

^{2) ^ ^ (d^Ud'^U) (2.28) 

To perform actual calculations with this Lagrangian, U is expanded in terms of the Gold- 
stone Boson fields <I>f- The prefactor (/^)^/4 in C^q^ makes sure that first term in the 
expansion is ^9^0f d'^4>F, the standard kinetic term. Why we start out with terms with 
minimal number of derivatives will become clear when power counting is introduced in 
section ITTl One always seeks to find a minimal Lagrangian. For example, a term of the 
form Tr (d^d^U U) turns out to produce the same equations of motion as the Lagrangian 
12.281 and can therefore be ignored. Note that we are not able to find a chirally invariant 
expression in terms of U fields that would produce mass terms of the form cppMpQCpQ for 
the Goldstone Bosons: Tr?7 is not invariant, TrUW = rif is constant, etc. This is the 
way it should be; the symmetry properties of the Lagrangian automatically require the 
Goldstone Bosons to be massless. 

Already at this stage, we can read off physical implications. For example, by applying 
Noether's theorem the axial current comes out as |Sch02j 

jA>^ = -iii^TT(^Xp[U,d>^U^])=-f^d>^^F + .: (2.29) 

^ {no\ j^^ \Mp)) ^ -/° d^^e-'P-- 5fg = i /° e-'^'^ 5fg (2.30) 

The axial current, an operator that induces a local misalignment of the axial degrees of 
freedom, can create Goldstone Bosons out of the vacuum! This is a feature of spontaneous 
symmetry breaking. In fact, the matrix element of a broken symmetry current sandwiched 
between the vacuum and the corresponding Goldstone Boson state is necessarily non-zero. 
It is now clear why is called the pion decay constant in the chiral limit. The 
pion decay constant Z?^^*^ measurable in experiments differs from only by corrections 
originating in the finite quark masses. 

This reminds us again that the theory we have set up so far would only apply if quarks 
were massless. There is a way to introduce quark masses without having to give up the 
tight constraints of chiral symmetry. The trick is to interpret the quark mass matrix mfg 
as an external field with the transformation behavior 

q^{x) ^ L q^{x) , q^ix)^Rq^{x) m ^ R m (2.31) 

Then obviously the QCD Lagrangian eq. (|2.7j) is still chirally invariant. Where does the 
quark mass matrix show up in the effective theory? Every combination of U and m fields 
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is allowed that transforms as a singlet under the symmetries of the Lagrangian. At lowest 
order in m, we can add a term of the form 

Cfl = il^^ Tr (mU^ + Um^^ (2.32) 

where S is a low energy constant. As such its value is not predicted by the effective theory 
itself. B is related to the quark condensate, which can be seen by comparing the vacuum 
expectation value of the mass term in the effective theory with the mass term in the QCD 
Lagrangian: 

^''=' ^^^^Tr(m + mt) {f^f B Trim) (2.33) 



{-qmq) = -mfg (gf^g) 



-mfg — Afg,o (qq) = — - Tr(m) (qq) (2.34) 
nf nf 



In the last line we have made use of {q Xp q) = for F = L.np, which follows from the fact 
that these objects form a vector under SU(nf)v'. Equating eq. (|2.33j) and (|2.34j) yields 



M 

nf 



(2.35) 



We read off the leading order Goldstone Boson masses from the terms of second order in 
the i.e. we bring the Lagrangian into the form ^ = ••• + 4>'F ^FG ^^G + ••• • In our 
case mpG is already diagonal, and by going back to the definitions of the particle fields 
one arrives at leading order particle masses 



2 


= 2Brh 


for 


nf 


= 2,3 


m\ 


= B [m + nis) 


for 


nf 


= 3 




= 1 B (rfi + 2ms) 


for 


nf 


= 3 



(2.36) 



These equations, referred to as the Gell-Mann-, Oakes- and Renner- relations 
(GMOR), are of central importance to our discussion. The fact that the different sub- 
types of pions TT^, vr^, and kaons K^,K^, K come out at equal masses is a consequence 
of neglecting differences in m„ and on the one hand and ignoring the electromagnetic 
self-energy contribution on the other hand. 



2.6 Adding Baryons 

For baryons, we need in the SU(2) theory Fermion fields for the proton-neutron doublet, 
and in the SU(3) case Fermion fields transforming as an octet. For simplicity, let us 
consider the SU(2) case, where we have a doublet of Fermionic baryon fields 




(2.37) 



2.6 Adding Baryons 



17 



We can look at the quark fields in section 12.41 for inspiration. Remember that we have 
completely moved the axial degrees of freedom of our theory into the fields u{x). So the 
excitations produced by ^'(x) should be devoid of axial degrees of freedom. For the purpose 
of constructing the Lagrangian, however, it is also helpful to introduce "unconstrained" 
baryon fields ip{x). In analogy to eq. p.2.Sj) . we decompose the fields into left- and 
right-handed components and write 

i/j^ = u^^^ tp^ = u^^ (2.38) 

We assign the usual transformation behavior to the unconstrained fields 

^^J^{x) L^^{x) i;^{x) Rip^{x) (2.39) 

Looking at equations (|2.19|) and (|2.23|) . we see that the axially constrained baryon fields 
^'^(x), ^'^(x) and thus ^{x), like the constrained quark fields, transform as isovectors 

L = R = V (x) ^ V ^'(x) (2.40) 

but remain invariant under pure axial transformations = R = A. Thus ^'(x) meets 
our requirements as a representation of the baryon doublet.'^ From the chirally invariant 
structures 

^[/t^« = '^^« = + ^ $u^^ (2.41) 

we assemble the lowest order Lagrangian of pion-nucleon interaction: 

C^^l = i^(^ + ]^{u$v) + v)$u) + {g\)'^{u$v) - v)$u)^ ^' + mo"** (2.42) 

(A term 7^ ^1/ $ is forbidden, because the Lagrangian must be of positive parity). There 
are two LECs in this Lagrangian: the nucleon mass in the chiral limit mo,^ and the axial- 
vector coupling constant in the chiral limit 5^. The latter owes its name to the form of 
the axial-vector current of the nucleon 

(7V(p)| \N{p)) = u(p)g^jf^^,^uip) (2.43) 
For massless quarks, gA assumes its chiral limit value g\ |Schn2j . 

Spontaneous symmetry breaking can account for the absence of parity doublets in the 
spectrum. We can now understand how. Without spontaneous symmetry breaking, it 
would be the operators tp^ which would generate the nucleons. Consider a nucleon state 
\Af,+) = ip'^ |0) = ■0'^^ + V'^^ |0) of positive parity. An infinitesimal axial rotation, e.g. 
A = exp(2zeQ^), transforms ip according to 

^^P^ - ieXitp^ ^^^ + ieXitp^ (2.44) 



^The explicit transformation behavior of Vl/(a;) is complicated and non-global due to a dependence on 
u{x) |Schn2| . 

^The existence of a finite nucleon mass mo in the chiral limit is related to the trace anomaly, and can 
be shown to originate to a large part from gluonic field energy |TW01| . 
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but leaves the vacuum |0) invariant by definition. Thus the transformation introduces 
a nucleon component \Af',—) = Xi{ip^^ — tp^'^) |0) of opposite parity. Since Lagrangian 
and Hamiltonian are invariant under axial transformations, |AA, +) and \M', —) have the 
same energy and can both be excited. So in a world symmetric under SU(nf)yi, one would 
expect to find nucleon states of both parities at the same energy in the spectrum. This is 
not what we observe. 

Due to spontaneous symmetry breaking we have a different picture of the world. What 
we call nucleons are excitations of the QCD vacuum by the constrained nucleon operators: 
\N,+) = ^ \Qo) = ^^"^ + \0,o). Attributing positive parity to the vacuum \0,o), the 
nucleon |A^, +) has positive parity as well. An axial rotation like A leaves ^ invariant 
per definition.^ However, it does not leave the vacuum invariant. In the chiral limit, 
A ir^o) = l^^i) has the same energy as |r2o) and still has the discrete momentum eigenvalue 
0. It must be another vacuum state. Then an axially rotated nucleon state looks like 

\N') = A\N,+) = ^A\no) = '^\ni) (2.45) 

It is an excitation of a different vacuum. In eq. (|2.13l) . we saw that no local interaction 
operator can mediate the transition between different vacuum states. Therefore, |A^') 
cannot be excited in nature. Spontaneous symmetry breaking constrains the spectrum. 
The axial degrees of freedom of states can only be modified locally, giving rise to Goldstone 
Bosons rather than axially rotated particle states. 

2.7 Higher Orders and Power Counting 

In the spirit of effective field theories, we should add to our Lagrangian all terms which 
share the symmetries of the QCD Lagrangian. An example is the purely mesonic term 

O4 = L4 Tr (d^Ud^U"^^ Tr [mU'^ + Um^^ (2.46) 

with a low energy constant L4 in front. Obviously, there are infinitely many such terms. 

Weinberg's power counting scheme offers a way to assign a formal order to a Feyn- 
man diagram. We expect diagrams of higher order to be suppressed. The amplitude 
^A{pl,p2, ■■■,Ta) of the diagram depends on "small" external momenta and quark masses 
m. Formally, the expansion of xPT is performed around vanishing quark masses (chiral 
limit) and vanishing momentum transfer (low energy limit), so we analyze the order of a 
diagram by substituting scaled momenta pi — > tpi and quadratically scaled quark masses 
m — > t^m. Counting the quark masses quadratically is inspired by the GMOR relations, 
which tell us that the leading order contribution to the Goldstone Boson masses is pro- 
portional to the quark masses squared. Then from 

M{tpi,tp2, ...,t^m) = M{pi,p2, m) + 0(f^+i) (2.47) 

we read off the chiral dimension D of the diagram. The meson momenta of in- and 
out-states are small quantities, because they vanish in the low energy limit of the chiral 



*In our new picture, A acts on the Goldstone Boson field U{x), as can be seen from the axial current 
eq. H2.291 in the effective theory. 
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limit. They appear directly as arguments pi of Ai in eq. (|2.47)) . External baryon momenta 
Pj cannot be treated as small quantities. We should rather decompose them according to 
Pj = niQ ■ V -\-pj, where u is a timelike unit four- vector, e.g. the four- velocity of the center 
of mass. The pj are again four-momenta that can be treated as "small", and appear as 



arguments of Ai in (|2.47j) . 

To simplify notation, the Lagrangian is organized in the following way 

C - 6^'^ + £(2) + £(2) ^(3) ^(4) ^(4) ^2 48) 

Contributions Ct^tt contain terms made up of Goldstone Boson fields only, while the con- 
tributions Cj^M contain interactions with baryons. The superscript indicates the chiral 
dimension of the terms. 

For example, the term O4 from (|2.46|) is part of cl^}, because a tree level amplitude 
M = (out I O4 I in) is proportional to the quark masses m (yielding two chiral dimensions) 
and to two external momenta (yielding one chiral dimension each). The two external 
momenta appear in the amplitude because of the derivatives of Goldstone Boson fields in 
the O4 vertex. 

We have already specified the lowest order contributions C^^j^ and C^} = C^q^ + C^^^ 
of the Lagrangian in eq. (|2.42|) . (|2.28j) and (|2..S2j) . At higher orders, the Lagrangian 
becomes quite long, and the number of independent LECs explodes jBCE99j . |FetOOj . 
Here, we quote the terms needed for an SU(2) calculation of the nucleon mass to order p^ 
|PHWn4] . We make use of the following abbreviations 

^^^^^^ 

X = 2Bm = I2 frT^ x± = u^X'^^ =t ux^u (2.49) 

1 
2 

where A is any object that transforms like ^, e.g., . From the Lagrangian, the terms 
displayed below contribute to the nucleon mass: 



Dfj_A = dfj_A + - [u^d^u + ud^v)\ A u^ = i (v}d^u — ud^v)\ (2.50) 



6^1 = sL^Tt{d^ud^^u)+^-^Tt[xU^ + Ux 

= ^'(i7^I)'^-mo)^ + ^5^^'7M75n'^^, 

= ClTT{x+)^^-^T,{u^u,){^>D^'D'^> + \l.c.) + ^T,{u^u^')^>^' + .. 

L%\ = e3siTrix+)f^^ + ^T^ixl-xlm 



Tt{xl)-{Tr{x-)f + TTixl)-mx+)f)^^ + --- (2-51) 

(3) 

Cj^^ does not contribute, ci, C2, C3, ess, ens, eng are new LECs. 

One can derive a formula to quickly read off the chiral dimension of a diagram |Sch02j . Let 
Nl be the number of independent loop momenta, In be the number of internal nucleon 
lines, Ni'^^ be the number of vertices from C^"^ and be the number of vertices from 
C^^^ in the diagram, then 

00 00 
D = 2NL + lN + 2 + J2'^{n- l)iV(2«) + ^(n - 2)ivS (2.52) 

n=l n=l 
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Figure 2.1: Example of a meson loop diagram. 



Obviously diagrams with loops are automatically of higher order. However, for this nice 
power counting scheme to be applicable, renormalization in the baryon sector has to be 
handled with special care, see section r2. Ill 

Equipped with a power counting rule we can calculate amplitudes in perturbation theory 
to a given chiral order p^, where p symbolizes "small" momenta or masses. 



2.8 Propagators 



We read off the lowest-order Feynman propagators from the free field parts of the La- 
grangian, e.g., for a pion of momentum k 

iSpik) = \ — (2.53) 

and a nucleon of momentum P 

iS^ (P) = ! (2.54) 

Using these propagators for internal lines, we now have almost all the tools at hand to 
calculate arbitrary diagrams within xPT, which is done as in any other Quantum Field 
Theory. 



2.9 Renormalization 



Let us take a look at one of the diagrams resulting from an interaction term of the La- 
grangian in SU(2) 

{Mp)\ TCfl \Mp)) = - + '^g cpR cPn \Mp)) +■■• (2.55) 

<^H</'H is a sum of meson loops - here pions, see fig. 12.11 Thus an amplitude involving the 
diagram above contains a term proportional to 

"^'^ ' ^I. (2.56) 



(2^)4 k'^-ml + iO+ 



The integral over all four-momenta k is not defined, because it diverges for large k. 
The problem can be solved using dimensional regularization, which replaces the four- 
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dimensional integral by a (4 + e)-dimensional one ^: 

1 f d^+'k i 



/^(e,A) ^ ^1 



(27r)4+^ k'i-ml + iO^ 



A has unit mass and is needed to give I-K{e, A) the correct, e independent units. It is called 
renormalization scale. In principle, A can be chosen arbitrarily. 

The resulting value of the integral has a pole proportional to 1/e. If the LECs were 
all finite, the divergence 1/e would end up in the final results for physical observables. 
This problem can be cured if we allow the LECs to have poles at e = 0. In our case 
— > 0,A) oc m^, so in accordance with the power counting rule (|2.52|1 our diagram is 
of chiral order p^. If we equip the LECs in cl^} with appropriate divergences, tree graph 
contributions from C^J can cancel out our loop divergence. Thus a set of terms in the 
Lagrangian acts as counterterms, balancing the infinities of the loop integral. Since 
an effective theory already contains all terms conforming to the symmetries, structures 
able to act as counter terms are available from the start. The finite remainder of a LEC, 
carrying information of physical relevance, is expressed in terms of a renormalized coupling 
constant. 

Let us look at our situation in mesonic xPT. Here, we can apply the following prescription 
for the one- loop renormalization of a low energy constant Li, now formally e- and A- 
dependent: 

L,(e,A) = L[(A) + C,A(e) + 0(e) (2.58) 
where Ci is a fixed constant which encodes the strength of the pole, and 

A(e) ^ - ^(ln(47r) - je + 1)) (2.59) 

carries the singularity. The renormalized coupling constants -^^[(A) are finite. These are 
the values that can actually be extracted from fits to experiment or lattice calculations. 
For our discussion it is important to keep in mind that renormalized LECs determined 
in different analyses are only comparable when the renormalization scale A is the same. 
Demanding A-independence of the final amplitude in leading order of e yields the transition 
rule from one scale Ai to another A2 : 

Lr(A2) = Lr(Ai) + ^ln(^) (2.60) 

Through renormalization of the LECs, in any physical observable, the divergent part of 
/^(e, A) will be cancelled. In our example, only the finite, renormalized part 

I.(A) ^ Ue, A) - 2m2 A(6) + 0{e) = ^ In ^ (2.61) 
enters the final expression. Thus loop graphs give rise to the typical chiral logs. 



^How these integrals are defined and calculated can be found in the literature |Mut87l|Sch02| . 
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The chiral log ln(m,r/A) vanishes if we set A = m-,^. This is a convenient choice as long 
as the pion mass remains fixed throughout the discussion. In the SU(2) mesonic theory, 
where niTr is the only mass scale, one therefore defines for the LECs li of C^} 

l = ^^^ll{fn^) such that l-{X) = -^(j, + ln(^^^^ (2.62) 

The ji correspond to the Cj in eq. (|2.58|) . In terms of the Ij, formulae for mesonic ampli- 
tudes are free of chiral logs, as though renormalization had never been an issue. However, 
the LECs are now implicitly m7r-dependent. For our purposes, this is not acceptable, 
because for us encodes the quark masses via GMOR. We want to study quark mass 
dependence of observables, keeping the LECs fixed. 

The integral 1)2. 56(1 was originally undefined. Renormalization assigns a value /7r(A) to 
it that captures the dependence of the integral on the external variables in the valid 
regime of the theory. The method of choosing such a value is not unique, so that various 
renormalization schemes at arbitrary renormalization scales describe the same physics (up 
to higher order effects) . Checking whether divergences and scale dependences cancel in the 
final result offers a way of validating internal consistency of the renormalization scheme. 

The loop graph discussed in this section is of a particularly good nature. It results from 
a term at chiral order and produces a single contribution at a higher order, namely p^. 
Thus the loop is automatically suppressed in the chiral expansion. This is a pre-requisite 
if we want to terminate the series at a specific loop order. 



2.10 Chiral Scale and Natural Size 

Putting (IT^ and (07)T|) together, the factor l/^iirf^f appears in our final loop ampli- 
tude. This factor is generally found in meson loops, and inspires the estimate of the chiral 
scale 

Ax ^ 47r/0 (2.63) 

This choice of as the symmetry breaking scale and the "dimensionful parameter 
that suppresses non-renormizable terms" was originally motivated in ref. |MG84j . where 
it also served as a physically sensible cutoff. The value A^ 1 GeV can be brought into 
direct connection with empirical observations as well [DG.H84]- 

Loop contributions are suppressed by a factor of the order p/A^ per chiral order. It is 
sensible to expect this also for the contributions from structures in the Lagrangian that 
constitute counterterms of loops. In order to treat all interaction structures on an equal 
footing, one may come to the belief that, at an appropriate renormalization scale A, all 
terms in the Lagrangian should follow this rule. This has implications for the expected 
magnitude of LECs, and is called the natural size argument. Consider, as an example, 
leading order tree level contributions in SU(2) from two sample terms 

£S = ci Tr(x+)^^' + . . . = ci Afnl'^^ + ... (2.64) 
Cl% = 638 (Tr(x+))' + . . . = 638 16m^^^ + . . . (2.65) 
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Identifying in the above expressions with a quantity of order 



we see that requiring 



0(e38) = -rTO(ci) 



(2.66) 



estabhshes "natural size" behavior of the term proportional to 633. With ci ~ — lGeV~ 
and allowing for some variations, we have in general a quick-and-dirty rule of thumb, 
stating that (in our notation of the Lagrangian) couplings should take values at scales 



where D is the chiral dimension at which they occur in the Lagrangian. 

Going even one step further, we may think of a probability distribution (in the Bayesian 
sense) according to which we find the LECs distributed. The terms in the Lagrangian 
are an effective description of QCD, summing up various underlying elementary processes. 
Thus within this line of thinking, it is sensible to assume the LECs follow a Gaussian 
distribution. 

Personally, I do not put much confidence in constraints following from natural size con- 
siderations. 

2.11 Infrared Regularization 

Loop diagrams containing baryon lines are less good-natured. It can be shown (e.g., 
|BL99j ) that in general all loop diagrams can be expressed in terms of scalar integrals 
containing baryon and meson propagators only. As an example, the following typical loop 
integral needs to be evaluated in the process of calculating the nucleon mass to order p^: 



where P is the nucleon momentum, with P^ — ttiq = 0{p). 

Calculating such integrals using straightforward dimensional regularization gives results 
that are not of higher order. Consequently the power counting rule (|2.52|) is not valid, 
because loops are not suppressed. This means that in principle loop graphs must be 
summed up to all loop orders at a given chiral order. Responsible for this unpleasant 
behavior is the appearance of the baryon mass rriQ in the integrands. It is a constant of 
the order of A^, and must be treated as a quantity of chiral order p^ . 

One way of getting around this problem is the heavy baryon formalism (HB^PT), cor- 
responding to an expansion of relativistic effects in powers of l/rrio in a suitable inertial 
frame of reference. The method avoids parameters of order p^ in the propagator, making 
power counting according to (|2.52j) possible. However, the convergence properties of the 
expansion are known to be bad in some important cases pL99 . 

A more recent approach to the problem is called infrared regularized or relativistic 
baryon chiral perturbation theory, was first described at one-loop order in ref. |BL99j 
and has been extended to arbitrary loop order |S(TSn4j . It is a variant of dimensional 
regularization applied to the original, fully relativistic Lagrangian of xPT. 



3 ... 3 GeV 



D+l 
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In section [2.91 we had an integrand containing only one fixed Lorentz-invariant quantity, 
m,r- Setting A = rnT^, the loop integral vanished, but then the LECs were implicitly m7r- 
dependent. The baryon propagator introduces new Lorentz-invariant quantities, one of 
them being the nucleon mass niQ. We would not mind if the renormalization of other 
LECs depended implicitly on ttiq. The LECs are fixed properties of the ground state of 
the QCD Hamiltonian in the chiral limit; the interdependence of the LECs is not an issue 
here. In contrast to m-j^, which encodes the strength of our external quark mass field, mo 
is not a variable. Can we adapt the A = mTr-trick to the baryon sector? 

Infrared regularization can be thought of as an implementation of this trick in the general 
case, when both ttiq and "small" parameters of the order p occur in the integrands. For 
the integral i^A^Tr we may introduce dimensionless parameters of order p 

r._P'^ - ml 

a = , p = 7^—^ (2.68) 

772-0 

Working with the Feynman-parametrization, the integral becomes 

H^e, A) = {^y^^^ I' dz {z\0 + 1) - z{P + a') + a') 



(2.69) 



Our task is to capture the behavior of the integral for a, <C 1. In the limit a,/3 — > 0, 
the integral does not vanish 

T^Niri^, A) is the culprit in the integral value, spoiling power counting with its contribution 
of order p^. Yet, it does not depend on the variables of order p. Therefore it is called in- 
frared regular. It can easily be absorbed by counterterms. In fact, the form of TZ]s[Tt{€, A) 
suggests that it can be canceled by a suitable renormalization of mo itself. On the other 
hand, the remaining part of the integral 

iNnie, A) = HnA^, A) - 7^7v^(e, A) (2.71) 

turns out to be of order p and respects the power counting rule eq. 1)2. 52() . 

Now renormalization is performed in two steps. In the first step we renormalize the LECs 
in such a way that in all loop integrals terms like TZisf-^^e, A) are completely compensated for 
by counterterms. In other words we define the renormalized value of our "culprit" term to 
heTZjsfTj = 0. Note however, that now mo enters our definition of the renormalized coupling 
constants. Thus we have an mo-dependent renormalization scheme. Power counting is 
violated, therefore this part of renormalization must be performed to infinite loop order. 
However, we never need to do it explicitly. From now on, whenever we speak about a 
LEC Li, we actually refer to its self-consistently renormalized value after this primary 
implicit renormalization step. Since the renormalized value TZnt^ of the integral is A- 
independent, so are the renormalized coupling constants Lj. At this stage, the loop integral 
only contributes with T7V7r(e, A). This remaining part still carries divergences, which are 
treated explicitly in a second, mass independent renormalization step, in the same way as 
in the mesonic sector. 
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The choice of TZn-k{^-,)^) above corresponds to the simple method described in ref. 
|FGJS03j . In ref. |BL99j Becher and Leutwyler actually propose a (more involved) proce- 
dure of determining the infrared regular part. ^ The result is of the form 

RnA^. A) = (^) ' ^^0Lg{a, /?, e) = TZnA^, A) + 0{a, (3) (2.72) 

Here, 5 is a function which can be expanded in an ordinary power series in a and (3. 
Again, RNn{^,^) can be absorbed by counter terms, thanks to the properties of g. The 
remainder Ijv,r(e, A) = Hi\iT^(e,X) — RNTt{e,\) is called the infrared singular part and 
respects the power counting rule eq. (|2.52|) . The advantage of the method is its simple 
relationship to HB^PT. Expanding an expectation value calculated in this framework in 
powers of 1/mo and truncating the series at a given order produces the corresponding 
expression obtained within HBxPT |BL99j . Thus infrared regularization can be seen as a 
way of doing HBxPT to infinite order in 1/mo. In practice, infrared regularization offers 
a way to directly calculate the infrared singular part /jv7r(e. A). The infrared regular part 
plays no role in the calculation and can be discarded without ever being evaluated. 

In summary, infrared regularization restores a valid power counting. This is important, 
because otherwise there would be no concept of order in the chiral expansion. If one 
works with an invalid power counting scheme, going to the next "order" changes the 
renormalization scheme of the LECs, making numerical results obtained at lower order 
worthless. 



2.12 Calculating the Nucleon Mass 

This diploma thesis makes heavy use of a formula for the nucleon mass calculated within 
infrared regularized xPT in isospin symmetric SU(2). In the chiral limit, the nucleon mass 
is mo. This is ensured by infrared regularization. At non- vanishing pion/quark masses, 
the nucleon mass can be read off from the pole of the nucleon propagator ^ 

iA{P) = J d^xe-'^-^no\T'i>{x)'^{0)\no) (2.73) 

Making use of the nucleon free field propagator, a perturbative expansion yields 

00 / • \ n 

iA(P) = -r -y -^S(P)- =-r (2.74) 

J* - mo + iO+ ^ V - mo + iO+ y J* - mo - S(P) + iO+ 

The self-energy S(P) is the amplitude of all one-particle-irreducible diagrams of a prop- 
agating nucleon. Lorentz invariance requires that S(P) can be written as 

j:{P) = mog{P^)-ff{P^) (2.75) 



®The infrared regular part 7?.]V7r(e, A) of ref. |FG,TS03] could be called "minimal", in the sense that 
only terms violating the power counting are included. In contrast, the infrared regular part _Rjv,r(e,A) 
of ref. |BL99) could be called "maximal". Becher and Leutwyler obtain it by modifying integration 
limits. Alternatively, the integrand in eq. H2.69|l can be expanded in terms of the small quantities a, 
p. Interchanging summation and integration and adjusting e as needed for each term produces a series 
expansion of this maximal infrared regular part |SGSn4) . 

'^iA(p) is actually a matrix in flavor space. We can ignore this, since we neglect isospin breaking 
(m„ = nid). 
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The nucleon is on the mass shell when — uiq — S(P) in eq. (|2.74)) is a singular Dirac 
matrix, and at rest when P = {rriN, 0, 0, 0)^. Hence uin^^ (l + f{m?j^)^ — mo (l + g{m?^)) 
must be singular. The negative solution belongs to the anti-particle and is ignored. Then 



mN (l + f{m%)) = mo (l + g{m%)) 
Thus the nucleon mass is calculated in the following steps: 



(2.76) 



Evaluate the self energy S(p) to the desired order in p. 
Renormalize loop diagrams in S(p) using infrared regularization. 
Decompose S(p) according to eq. (|2.75|) . 
Solve eq. (|2.76jl for mAr at the desired order in p. 



Here, we study the nucleon mass formulae up to order p^ given in |PHW04] . Using the 
Lagrangian as quoted in section 12.71 the authors of |PHWfl4] find the following mass 
formulae: 



to order p"^ {leading order, LO): 



m 



(<2) 



N 



mo - Aciml , 



(2.77) 



to order p^, {next to leading order, NLO) 



m 



(<3) 
N 



mo - 4cim^ + 
3(5^)^ 



\2 



m^ 



167r2(/0)2 - 



ml 
4mQ 



arccos 



m^\ 
2mo) 



m„ 



(2.78) 



to order {next to next to leading order, NNLO), truncated at m^: 



m 



(<4-) 
N 



mo — icim: 



^2 3(5^)^ _ 



m^ 



+ 



(A) 



327r(/0)2 - 

3 /(^ 

mo 



647r2(/0)2 



£2 

2 



327r2(/0)2 



2567r(/0)2m2 



mo 



8ci + C2 + 4c3 ) In — 



m„ 



(2.79) 



Thanks to the application of the renormalization procedure described above, all these 
formulae are independent of the renormalization scale A. As we have seen, this may require 
the inclusion of diagrams from higher chiral order to provide appropriate counterterms. 
Indeed, at order p^, some linear combination of 633, ens and eng from C^^j^ is needed to 
compensate the divergence of the loop in fig. 12.21 (a). We have called the corresponding 
combined renormalized coupling constant e^'^^(A). At order p'^, contributions from C^^j^ are 



2.13 Pion-Nucleon Sigma- Term 
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Figure 2.2: One-loop graphs of order (a) and (b,c) contributing to the nucleon 
self-energy. The solid dot denotes a vertex from >C^^^, the diamond a vertex from C^^]yf. 



fully included, and result in a term proportional to ei = — IGess — 2eii5 ~ 2eii6, which acts 
as a counterterm to all three diagrams in fig. 12.21 We call its finite contribution e^^^(A). 
The necessity to include counter terms at order p^ would have arisen if the authors of 
|PHWfl4] had not chosen to truncate their formula at order m^, which has been checked 
to be an adequate approximation. 

Note that even though e^^'' (A) and e[^^ (A) appear at the same order in the mass formulae, 
they may belong to different combinations of coupling constants and exhibit a different 
A-dependence. This will be a complication in our convergence analysis, see section 16.2.61 



2.13 Pion-Nucleon Sigma- Term 



A quantity of interest closely related to the nucleon mass is the pion-nucleon sigma- 
term 

GN = {N\ muuu + mddd \N) (2.80) 

It is a measure of explicit symmetry breaking, giving the contribution of the light quark 
mass terms to the nucleon mass. 

From the Feynman-Hellmann theorem, also employed in |PHW04] . a reasonably accurate 
expression for in xPT can be obtained via 

~ 5^ (2-81) 



Thus all we have to do to produce a value oi is to form a derivative of our nucleon 
mass formula at the physical pion mass rr^^^ . The result can be compared to values in 
the literature. 



2.14 Other Frameworks 

Chiral perturbation theory is a very flexible tool. The choice of fields as well as the method 
of evaluation can be adapted to the problem under study. 

For an analysis of the quark mass dependence of the axial- vector coupling g^, for example, 
the authors of ref. HPW03 find that it is vital to explicitly include the A(1230) nucleon 
excitation. They utilize HB^PT with extra fields encoding A(1230), a framework called 
non-relativistic Small Scale Expansion (SSE) |HHK98j . The expansion parameters. 
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collectively called "small scale" e, now include the mass difference ttia — ttt-tv ~ 0.234 GeV 
of the A(1230) resonance and the nucleon. 

xPT can also be adapted to the special needs of lattice extrapolation. For instance, 
converting meson loop integrals into finite sums can account for the effects generated by 
the finite box size of the lattice volume, see section HTH Even the side effects of a particular 
lattice action can be taken into account, like taste violations from staggered Fermions, see 



e.g. ref. |A+n4b . 



Chapter 3 

Basics of Lattice Field Theory 



The following chapter aims to give a quick introduction to the principles, terminology, 
advantages and deficiencies of modern lattice theory useful to know when making use of 
lattice results. The focus is on clarity rather than on completeness and accuracy. For 
details, consult refs. |Rot97[ Lep98| . 



3.1 Philosophy 

Lattice Field Theory is a way of evaluating Quantum Field Theory numerically with "brute 
force" - but it is more: It is a way of understanding Quantum Field Theory as such. 

The Lagrangian of any continuum Quantum Field Theory contains fields 0(x) and their 
derivatives c?^0(x). Remember how a derivative is defined: 

^ hm -/'(^ + «S) - <^^^) (3.1) 
a^O a 

where is a base vector of unit length {x = x'^e^). In the following, we abbreviate 
ae^ = /i. The definition of the derivative is not unique. We could equally have chosen the 
central difference 

4>{x + /i) - (t){x - fi) 
a-^b 2a 



^ hm^ '"^"^^^^7^" ^' (3.2) 



At finite a, the different representations have certain numerical advantages or disadvan- 
tages. For example, for a smooth field (j) that can be represented as a Taylor series around 
X, we have 

a^(t){x) = h 0{a) , d^(p{x) = h 0[a ) (3.3) 

a 2a 

The central difference converges at a higher order, but it extends over three points: x — fi, 
X and X + jl. 

Lattice Field Theory constitutes a formulation of the theory before the continuum limit 
is taken. Just like our trivial example above illustrates, a number of such formulations is 
possible, each one with its own strengths and weaknesses during numerical evaluation. 
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3.2 Principle 

In the path integral formalism of a continuous field theory, the expectation value of an 
observable 0[(/), 9^] is 

_ fVcpO[cP,dcP]eM^S[cl>,dcP]) 
^ ' JVcj) exp{iS[(l),d(P]) ^ ■ ^ 

where the action is 

S[<j),d(l)] = [ dx^ [ dx^ [ dx^ [ dx^ C{<j){x),d(j){x)) (3.5) 



Eq. (|3.4j) almost has the form of a weighted average over field configurations (p. The 
exponential exp{iS[(j),d(p]) yields a wildly oscillating phase factor except in the proximity 
of stationary configurations. For a numerical evaluation of the path integral this behavior 
is undesirable. Therefore, we perform a Wick-rotation of space-time in eq. 13.51 The 
integration path — cxd . . . oo of the time coordinate x^ is rotated to — ioo . . . ioo. This leaves 
the value of the integral invariant as long as the integrand has no poles in the area swept 
over by the rotating integration path. Equivalently, we can replace x^ by "imaginary time" 
x^ = ix^. In terms of the Euclidean action 

SE[(p,d(l)] = j dx^ j dx^ j dx^ j dx^ C{^{x),d(p{x)) (3.6) 

the path integral reads 

^ fV<PO[cl>,dcP]eM-SE[cl>,dcP]) 

^ ^ /P(/> exp(-5ij[(/>,50]) ^ ■ ^ 

Typically, modern lattice calculations work on a uniform lattice of lattice spacing a : 

{xi, X2, X3, ...} = { (an^, an^, an^,an'^)^ | n"^, n^, n^, G Z } (3.8) 

(Here the lower index identifies a lattice site; it is not to be confused with a Lorentz index). 
The action is now ctpproximatcci in tGrins of fiGld. values on the lattice sites Xn- 

SE[(t>,d4>] Sis.t{(f>{xi),4>{x2),(t){x3),...) (3.9) 

How this is done explicitly is discussed in the following sections. On the lattice, the path 
integral becomes 

_ /#(xi) /d(/>(x2) • • • 0((/'(xi),(/)(x2), ...) exp (-5iat(0(a;i),0(x2),...)) 
/ d(p{xi) f d(l){x2)--- exp (-5iat(</'(a;i),(/)(x2), ...)) 

So far, there are still (countably) infinitely many integrals and field variables (pixn) in the 
expression. Since computers can only work with a finite number of values, one applies 
periodic boundary conditions^: 

(j){Xn + ilatA) = H^n) for /i = 1..3 (j){Xn + Tlat/i) = ^(Xn) for = 4 (3.11) 

Typical modern lattices have Li^t ^ 30, Tiat ~ 60. We abbreviate L = aLi^t, T = aTiat. 
All in all, the calculation takes place in a box of the size Lx Lx LxT in discrete Euclidean 
space-time, compare fig. 13.11 



^or, as the case may be, antiperiodic boundary conditions 



3.3 Free Fermions 
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Figure 3.1: Lattice in 1 + 1 dimensions. 



Note that in Fourier space, only wave functions (/'(x) oc exp(z/c^x^) with discrete momenta 
k can be represented on the lattice. Due to periodicity and finite lattice spacing ^ 



k G 



27rZ 27rZ 2t\T 2t:T 



L ' L ' L ' T 



TT , TT 

- <k^<- 
a a 



(3.12) 



As we have seen in section lT^ loop integrals of the form J d^k in a continuum theory can be 
divergent. Working with a finite set of momenta, the lattice imposes automatically both an 
ultraviolet and an infrared cutoff: Since integrals appear as discrete and finite sums over 
all allowed momenta, see eq. ()3.12|) , very small and very large (Euclidean) momenta do not 
contribute. Therefore, all quantities on the lattice are automatically finite. Nevertheless, 
they need to be interpreted within this lattice regularization scheme. 
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a) ^(x,) - q(x.) 





Figure 3.2: Fermion doubling problem: a) The central difference is blind to a wave with 
wavevector vr/a. b) Dispersion relation in the continuum and on the lattice for simple 
Fermions. 

3.3 Free Fermions 



To illustrate how the discretization of the action is done, consider the action of free 
Fermions^ 

* = (5(.)7A<,(.)+5(:,0,n,(.)) (3.13) 

Replacing the integral by a sum over all lattice sites n and expressing the derivative as a 
central difference yields 



SL = X] + - (l{xn - /i))^ + q{xn) m q{xn)^ 

Writing Dirac indices as a, /3, the Fermionic action is of the general form 



(3.14) 



(3.15) 



For this simple lattice action, the integrals in eq. (|3.in|) can be carried out analytically, and 
one can check whether the dispersion relation approximates the physical one. For small 
momenta <C vr/a, this is the case. Unfortunately, at this point, one has to deal with 
an unpleasant property of the central difference used in the construction: Consider the 
configuration q{x) oc exp(iA;^x'^) for a momentum on the "Brillouin zone", for example 
with k = (vr/a, 0,0). Here, the central difference in the action vanishes for /i = 1..3: 
qixn + f^) ~ <l{xn ~ fJ-) = 0. This is due to the fact that the central difference works 
only with lattices sites 2a apart, and therefore cannot deal properly with momenta larger 
than vr/2a, see fig. I3.2t i. As a result, there are spurious low energy modes of Fermions 
with non-zero momentum on the Brillouin zone, see fig. 13.2b . In a lattice calculation, 
they would be excited along with the "true" Fermions. This is the Fermion doubling 
problem. The workarounds involve compromises: 



^wave functions 4>{x) oc exp{iki_ix'^) and cj>{x) oc exp(i(fc^ + 27r/a)a;^) have the same representation on the 
lattice. We are interested in the low energy domain; large momenta should be cut off anyway. Therefore 
we interpret this ambiguity as a boundary on our momenta. 

^We ignore here details of notation in Euclidean space 



3.4 Gluons - The Gauge Field 
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Figure 3.3: Circumventing Fermion doubling: a) dispersion relation for Wilson Fermions 
b) staggered lattice made up of coarser lattices of lattice spacing 2a, as indicated by the 
different types of nodes. 

• One could use a non-central derivative, but that destroys Hermiticity of K^J^. 

• For Wilson Fermions, an additional term 

X] ^ 9(3;) + /i) - 2g(x) + g(x - /i)) =a^q{x)aq{x) (3.16) 

is added to the Lagrangian, its strength being controlled by the parameter r. The 
operator □ is 9^9^ in the continuum limit. The Wilson term treats Fermions like 
scalar particles, and vanishes in the continuum limit. It lifts the energy of the 
spurious modes on the Brillouin zone, see fig. 13.3b ,. It also shifts the quark mass, 
which is now no longer m. Unfortunately, Wilson Fermions do not exhibit chiral 
symmetry for vanishing quark mass. A number of modifications have been found to 
restore chiral symmetry, though at considerable computational cost. 

• Staggered Fermions by Kogut and Susskind address the Brillouin-zone problem 
by assembling the lattice from overlapping coarser lattices of lattice spacing 2a, 
as illustrated in fig. 13.3b . Fermionic degrees of freedom on each of the coarser 
lattices exhibit the correct dispersion relation. However, this approach requires the 
introduction of a new unphysical quantum number: there are now four different 
quark "tastes". Therefore, staggered Fermion calculations exhibit "taste splittings" 
for example in the mass spectrum. The splittings should vanish for a ^ and are 
therefore an indicator of systematic errors. 

3.4 Gluons - The Gauge Field 

Introducing gauge symmetry of the QCD Lagrangian starts out with the wish that one 
be able to choose the "coordinate system" of color freely at every point in space-time. 
That is the meaning of the transformation law eq. 1)2. 2|) . It is possible to implement 
exact color gauge symmetry on the lattice, even at finite lattice spacing a. This justifies 
the name "lattice gauge theory" . The construction procedure on the lattice is even more 
straightforward than in the continuum. Note that the resulting gauge invariant lattice 
action differs from a lattice action derived by naive discretization of the continuum action. 
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Let us neglect flavor here, and write the quark fields as color vectors q = (^a)r=i..3- We 
have seen that free quarks on the lattice have an action of the form eq. (|3.15|) .^ We 
must simply ensure that Fermion bilinears remain invariant under gauge transformations 
(l{xn) W{xn) q{xn)- For two adjacent lattice sites x„ and x„ + /i, this can be achieved 
by introduction of a transformation matrix U^{xn) G SU(3), which specifies how the color 
coordinate systems of the two lattice sites are rotated with respect to each other. U^{xn) 
is called a link variable, and is in general depicted cLS dbH arrow connecting and Xn + fl- 
its transformation property 



Uf,{Xn) ^ W{Xn) Uf,{Xn) {Xn + (l) 



U^{Xr, 



(3.17) 



guarantees that an appropriately modified bilinear remains invariant: 

X = QaiXn) U^{Xn) QpiXn + A) ^ qa{Xn) WHxn) U^{Xn) W{Xn + A) q^iXn + fl) = X 

(3.18) 

Replacing the quark bilinears qa{xn)(lp{xm) in eq. (|3.15|) by their gauge invariant coun- 
terparts q^{xn) U^{xn) qp{xn + ft) and embedding the link variables U = (C/^(rE„))^ ^ into 
the coefficient matrix K'^, the gauge invariant Fermionic action attains the form 



'lat 



n,m,a,p 



(3.19) 



In order to relate color coordinate systems of nonadjacent lattice 
sites, we can form products of the link variables. For example, 
both U^{xn)Uy{xn + f) and Uy{xn)U ^[xn + v) describe a rotation 
of the color coordinate system when going from lattice site x^i to 
a^n + A + ^- Initially, one might require that both rotations be 
equal. Consider the product of link variables 



U^u{Xn) = U^{Xn)Uy{Xn + fj-W^iXr, 



+ z})[/t(a 




X +fl+v 



(3.20) 



which describes a closed path (a "Wilson loop") around a square of lattice sites of side 
length a (a "plaquette" ) . With our requirement, we should have U^u{xn) = 1- Actually, 
demanding this would take us back to interaction- free Fermions. So let us allow U^u{xn) 7^ 
1. In analogy to parallel transport in general relativity, this means that color space is 
internally curved. The curvature causes tension, i.e., a restoring force, if we add the 
following term to the action 



cG 
'-'lat 



J^Tr(2-l-C/^,(x„)-?7t,(x„) 



□ 



(3.21) 



where /3 is a constant, and is a sum over all plaquettes on the entire lattice. It turns 
out that in the continuum theory a link variables U^{xn) correspond to an object 



U{Xn,Xn + fJ-) =1^6^^ {^ig— J dz Af,^A{z) 



(3.22) 



^Spelling out the color indices explicitly, it is a sum of color singlets of quark field bilinears 
q^^{xn)qi3a,{xm) , relating the field value on one lattice site to the field value on another. 



3.5 The Calculation Scheme 



35 



where V denotes a path ordering. Thus Uf^{xn) actually encodes the gauge fields ^|U,a- 
Furthermore, reproduces the kinetic term of the gluon field: 

Sit^\G|^^^Ag^.uA for/?=^ (3.23) 
We now have all the ingredients for a discretized gauge invariant QCD action S\a,t = 

As already mentioned, it is possible to find improved actions which converge more 
quickly to the continuum limit, e.g., with a discretization error 0{a^). 



3.5 The Calculation Scheme 

With the lattice action, the path integral looks like 
(O) oc I [dU] I [dq] J [dq] 0[U,q,q] exp l-Sg, - ^ g,(rr„) K^[U] qf,{xrn)\ (3.24) 

Here, [c/?7] , [dg] , [c?g] symbolize integration over all degrees of freedom at all lattice sites. It is 
possible to directly integrate out Fermions, treating the Fermions as Grassmann variables. 
The result is of the form 

(O) oc y \dU\ 6\U\ exp (-51*^0 det(K[C/]) (3.25) 

The remaining integral still has a huge number of degrees of freedom of the order of 
-^fat-^lat- Its numerical evaluation is achieved by clever sampling, most prominently by 
random "Monte Carlo" methods like the Metropolis algorithm. At each sampling step, 
the algorithm produces a set of numbers specifying the values of all the link variables IJ 
on the lattice. Such a set of values is called a gauge configuration. In the end, one 
obtains a whole ensemble Eu of A'" gauge configurations which contains a configuration 
U with a probability 

P[U G Eu] oc exp (-5gt) dei{K[U]) (3.26) 

The calculation of the ensemble is very costly. In particular the determinant Aei{K\U\) is 
problematic. In the quenched approximation it is simply set to 1. This is equivalent 
to neglecting spontaneous quark fluctuations - sea quarks. Nowadays it is possible to 
take the Fermion determinant into account. This diploma thesis will make use of results 
from such unquenched (or fully dynamical) simulations exclusively. Luckily, once an 
ensemble is calculated, it can be used for the evaluation of many different observables, 
simply using the sample average 

(0)<^^J2 0[U] (3.27) 
ueEu 



The usage of samples and averages leads to statistical errors depending on the ensemble 
size as Ij^fN. 
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3.6 Extracting Masses 

In order to determine the mass of a hadron, one takes an operator which carries the 
quantum number of the desired particle. For the pion, one might take 

xHp, r) = ^ e'P-"^" Ua(^n, r)75rfa(x„, r) (3.28) 

which creates a pseudo-scalar color singlet of an anti-down up quark pair. Such an operator 
generates a whole spectrum of states, but only the pion is of interest, which is the mode of 
lowest energy.^ Consider the propagation of the hadron for a certain Euclidean time r : 

c{p, t) = (X{p, t)X^p, 0)) = J2 ^(P' 0) l«) e" ("I X^P, 0) jf^o) (3.29) 

a 

In the sum over all intermediate states \a), high energy modes fall off very quickly with 
T. In the limit of large r, only the excitation |7r) of lowest energy E^^^ contributes: 

c(p, r) PS const • e~^'^^^'^ for r ^ oo (3.30) 

The idea is to calculate c(p, r) on the lattice and to determine the rest mass m,r = 
(£;2p.-p^)i/2 of |q,^) from a fit to eq. ^^TM, at large r. To that end, one substitutes 

an operator O = X(p, t)X^(p, 0) in eq. (|3.24() . Integrating out Fermions analytically, 
one obtains the operator O in terms of the gauge fields U , which can then be evaluated 
according to ()3.27|) . Note that due to the periodicity of the lattice in the time direction, r 
must be sufficiently small compared to half of the lattice extent T, otherwise the operator 
X{p,t) will "feel" the periodic image X^{p,T) of Xt(p,0). 



3.7 Finding the Physical Length Scale 

The natural unit of length on the lattice is the lattice spacing a. For the calculation, 
it is practical to work with quantities like fields, masses, and couplings multiplied by an 
appropriate power of a, such that they become dimensionless in terms of natural units 
(h = c = 1). For example, the quark mass is parametrized by am. The fundamental pa- 
rameters of QCD become dimensionless coupling strengths on the lattice. As an example, 
collaborations using Wilson Fermions (or improved versions thereof) typically specify /3 
and K = (8r + 2am)~^ as their simulation parameters. These two numbers encode the glu- 
onic coupling strength and the quark mass. For the analysis of lattice results, observables 
must be converted back to units of the real world. For instance, a pion mass calculation 
as described in the previous section yields am,r- To translate this into m^r in units GeV, 
we need to know the lattice spacing a in physical units. 

The determination of a requires the calculation of a suitable reference observable, which 
can then be compared to an experimental result. Early analyses used the p mass as their 
reference observable. Nowadays, the static quark potential is compared to empirical data 
from heavy quarkonia, such as T, a bb system. This method is advantageous, because it is 



independent of valence quark masses, and the lattice calculation is simple B"'"f)fl From 



^In practise, it is advisable to use an operator which is not exactly point like. For such a smeared 
operator, the high energy contributions are somewhat dampened from the start. 
® Basically, the operator involved is a Wilson loop. 



3.8 Uncertainties and Artefacts in Lattice Data 



37 



the spectrum of bb states one can calculate an effective potential. The same potential can 
be explored on the lattice by placing two infinitely heavy quarks on the lattice and varying 
their distance r. Typically, a reference length tq called the Sommer scale is defined in 
terms of the attractive force F{r) between the quarks according to 

F(ro)r2 = 1.65 (3.31) 

According to phenomenological models, tq = 0.5 fm |Som94j . So if an analysis shows that 
the condition (|3.31|) is met for heavy quarks that are ro/a lattice nodes apart, we know 
that their distance equals 0.5 fm in nature.'' Thus a mass mx is converted to units GeV 
via 



mx = hc= (amx) nc = (amx) ; — 0.1973CxeV rm 

a ro 0.5 fm 



(3.32) 



The dimensionless quantities amx and r^/a are those that are commonly tabularized in 
the literature. 



3.8 Uncertainties and Artefacts in Lattice Data 

In order to make calculations more realistic, larger and finer lattices would be required 
that demand a calculation power which modern supercomputers do not yet possess. At 
present, it is important to take the following uncertainties and artefacts into consideration 
when making a physical interpretation of lattice data: 

• Deviations from the physical value due to unphysically large quark masses (if not 
accounted for by matching to a quark mass dependent theory, such as xPT). 

• Deviations from the physical value due to the finite size of the lattice (if not accounted 
for by a finite size correction within the theoretical framework, e.g., xPT). 

• Statistical uncertainties from the ensemble average eq. (|3.27|) . Usually, lattice groups 
specify a one-standard-deviation error, and the fluctuations may be assumed to 
follow a Gaussian distribution. 

• Discretization artefacts due to the finite lattice spacing a. Ideally, these are further 
reduced by continuum extrapolations, e.g. using specially adapted forms of xPT. 

• Uncertainty about the lattice scale a. Lattice collaborations do their best to estimate 
this error, see e.g. |A"*"04a . The determination of a involves 



— statistical errors from the lattice calculation of the static quark potential, 

— systematic errors from a continuum extrapolation, 

— systematic errors from the use of the quarkonia model, 

— experimental errors from the quarkonium spectrum, 

— systematic uncertainties entering due to heavier sea quarks than in nature^ 



Note that the LHPC group prefers a reference scale ri, which is defined via F{ri)rl = 1. 
* The magnitude of this artifact can be studied to some extent by comparing to the quenched approxi- 
mation |B+00| . 
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Chapter 4 

Methods of Statistical Error 
Analysis 



This section explains the statistical concepts necessary for error analysis in a formal man- 
ner. It is based on concepts found for example in ref. ED.T^Tl . Important aspects of the 
method are given in ref. jJamj. In order to make the connection to the concrete task 
clear, the text describes the form of the problem and some basic techniques for the simple 
example of a fit of the pion mass expansion of the nucleon mass to two-flavor lattice data. 



4.1 How Statistics Relates Theory to Experiment 



An experiment produces a collection of numbers y. Because of uncertainties, fluctuations, 
and other random behavior, y represents a sample of a random variable Y_. Together 
with knowledge about the experimental uncertainties, a theory provides the probability 
distribution of y . Usually, theories depend on parameters p. Thus the object encoding the 
experimental predictions of the theory is a probability density WY{y\p) for the occurrence 
of y under the assumption of parameters p. If the theory is true, there exists a ^^true" 
parameter set p*''"'', such that wyiylp^^'^^) is the probability density of y realized in nature. 




+experimental uncertainties experiment 



Figure 4.1: How statistics relates theory to experiment. 



The task is to extract an estimate p^^^{y) of the true parameters p^'^^^ given data samples 
y. We would also like to know how precise our estimate is. Finally, we want to assess the 
plausibility of our original assumption that the theory is true. 
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4.2 Definition of the Problem 

In our simplest case, we take from lattice calculations the pion mass and the nucleon mass 
associated with it. For different lattice simulation parameters, we have a set of points 

{ m.„ = Xi± Axi, rriN = yi± Ayi ) ioii=l..N (4.1) 

The Xi and yi are the numerically calculated values, and the Axj and Ayi are one-standard- 
deviation errors. For the moment, let us assume the pion mass uncertainties play no 
significant role, and that the errors in the nucleon mass are uncorrelated. 

For our purposes, the lattice data take on the role of experimental data, in the sense that 
they form a source of information independent of our theoretic calculations within xPT. 

Chiral perturbation theory provides a function m]\f{x;p) which predicts the nucleon mass 
m^r for a given pion mass x and a vector of parameters p = {pi, ..,pn)'^ , the low energy 
constants. The first task is to determine the low energy constants and their associated 
uncertainties from a fit to lattice data. A parameter set p that comes close to the truth 
should yield predictions fi{p) = mN{xi;p) close to the calculated nucleon masses yi from 
the lattice. In a symbolic notation, the conditions we want to fulfill optimally by a suitable 
choice of the parameters p are: 

"/i(p)«yi±Ayi" fori = l..A (4.2) 

Later it will turn out that much more general fitting tasks can be brought into this 
simple form. Among other things, one can treat uncertainties in the pion mass, perform 
simultaneous fits and deal with some systematic uncertainties. 

Using the fit conditions (|4.2|) one can calculate an estimate p'^^^ and its statistical un- 
certainty. Plugging the estimate into our formula from xPT, one obtains the optimal 
interpolant mN[x]p'^^^). The statistical uncertainty of the estimate can be displayed as a 
band of interpolation functions at a given confidence level. 

In the following, the statistical problem associated to (|4.2j) is rendered more precise. 



4.3 and Maximum Likelihood Estimate 

Let us assume that there are no systematic errors in lattice calculations. Were the lattice 
calculations performed over and over again, the resulting yi would be scattered around 
"true" values yf^'^- Thus the data points yi are samples of random variables Yi. For our 
analysis, we assume that the Yi are statistically independent and have normal distributions, 
with expectation values yf^^ and standard deviations Ayi. Let us also assume that the 
theory is correct, i.e., 

h{v''n = yT (4.3) 

(Whether this assumption is plausible, i.e., whether the theory appears to be true, can be 
checked a posteriori). All in all we assume that the data follows a probability distribution 

"^^^'^^ = n -p — ya^ ^'-'^ 



4.4 Construction of Confidence Regions 
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Interpreted as a function of p, wy is called the likelihood function. It is convenient to 
introduce normalized quantities Si = yi/Ayi and ti{p) = fi(j))/Ayi as well as 

N 

x'iiip) = x\i,t{p)) = - ^i? = Up)-41 (4-5) 

1=1 

such that the probability density for the occurrence of lattice results s takes the form 

ws{s\p) = (27r)-^/2 exp (^-i x'Uw) (4-6) 

For given input data s, one can maximize the likelihood 'w{s\p) by tuning the parameters 
p. The resulting parameter set p^^^ is called maximum likelihood estimate. Taking 
many different sets of input values s would reveal that p^^^ is scattered around p^^^^ . The 
numerical determination of is done by a minimization of x^illp) with respect to p. 



4.4 Construction of Confidence Regions 



Now we would like to construct a region R of parameters which is known to contain the 
true parameters p*^^^ with a given probability {confidence level) CL. ^ For a given 
assumption on parameters p of the system, we can calculate the probability density of the 
estimate p^^{S_). The first step is to define a region of parameters R{p) which contains 
the estimate p^^^ with probability CL : 



Plp'^'iS) e Rip)] = CL 



(4.7) 



Actually, we are faced with the inverse problem: p is given, and we need a region R{p^^) 
which contains the parameters of the system p^^^^ with probability CL. Defining 

i?(p-*) = {p|p-*G%)} 

provides a solution to this problem, because R{p^^) fulfills 

P [p*™^ e R{p^\S))] = P \f^\S) e = CL 



(4.8) 



(4.9) 



Now the steps above must be carried out explicitly, making use of ■ One chooses 



R{P) = {P' I ( t_ip^), m ) < XCL,n(p)} 



(4.10) 



where XcL n (p) ^ threshold value for that must be appropriately chosen to enclose 
the desired probability content CL. 



Then the error band is given by 

Rit') = {P-- X'{ t_if''), m ) < xhL,niP)} 



(4.11) 



^In a sense, we are looking for something like a "probability density wp(p|t/)" of the parameter set p 
given samples y. Such a density cannot be defined, since p is not a sample of a probability variable, and no 
probability measure exists in parameter space. However, the probability density of the estimator 
is well defined. Therefore, the construction of confidence regions relies on the estimator. 
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Figure 4.2: The probability distribution tfy(s|p) for a 2-dimensional data space and a 
1-dimensional parameter space. 




Figure 4.3: Determination of the confidence region for a 1-dimensional parameter space. 

4.5 Calculating the Threshold 

We can visualize the way the method works by looking at the N dimensional data 
space (fig. 14. 4j) . where the measured results s G IR'^ appear as a point. In data space, 
= l|o;~^ll2' i-^- simply measures the Euclidean distance between a, 5 G IR-'^. 
The function t{p) constitutes an n dimensional manifold (a "surface" ) embedded in data 
space. 

The determination of the threshold XcL n becomes simple if we make the assumption that 
this manifold is planar/flat in the vicinity of = t{p^^^^). Note that a sufficient but 
not necessary criterion for flatness is linearity of t{p) . In the applications presented in this 
thesis, the manifold t{p) is sufficiently flat. The assumption of flatness enables us to make 
the decomposition 

s = t*™^+£ + r (4.12) 

where q is an element of the vector space Vq parallel to the manifold and r is perpendicular 
to it (see fig. 113)) : 

qeVq = { {t{p) - t*''"^) for any p} , Vq®Vr = R^, reVr, r-q = (4.13) 

The position of the maximum likelihood estimate f'^^ is at the point on the manifold closest 
to the data point s 



4.5 Calculating the Threshold 
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?(s,t(E'n)= const 



Figure 4.4: The manifold t{p). 



2 ▲ 



i\ s, r ) = i\ 




Figure 4.5: Flat manifold t{p) and threshold. 



We can now determine the probability content of R{p^'^^'^) as defined in eq. 1)4. lUj) . Using 
the notation 



(4.15) 
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we calculate 
P 



[ dq [ drws{f'''' + q + r\fne{ x'{f''''+q,t''''') < X^L,n ) 

JVq JVr 



(27r)-^/2 
(27r)-^/2 
(27r)-^/2 



/ dr exp 



+ r:li: I e \\q\\^<XcL,. 



dq dr exp ^-^ H^H^^ exp ^-^ ||r||2^ G H^H^ < XcL,n ) (4-16) 



Effectively, we integrate the probability content of the region shaded grey in fig. 14.51 The 
r-integral is 



dr exp 



Vr 



1 



i-ll2 



dx exp 



1 . 



N-n 



(2vr) — 



(4.17) 



Thus 



p'''\S) £ Rip 



true> 



d(7 exp 



(27r)-"/2 

(27r)-"/2o„y^ d<7g"-iexpf--g2 
{2.)-l^^n\ j^""^ dx x^exp(-| 



(4.18) 



where = 27r'^/Vr(f ) is the area of the (d-l)-dimensional surface of the unit sphere. 
Then finally 



P 



p^^*(5) G Rip" 



2"/2r(§ 



2) JO 



XcL 



, n-i ( X 
ax x 2 exp y—-^ 



= W^5^(X^L,„) = CL 



(4.19) 



^^\xch n) cumulative distribution for n degrees of freedom. Due to our 

flatness assumption, XcLn independent of p^^^^. 

For the practical calculation of Rip'^^^), we can use the Pythagorean theorem 

\\t_ip)-r\s)\\l = \\tiP)-s\\l 



k-f'*(i)||2 



(4.20) 



to obtain from eq. (|4.1H) the prescription for the determination of the confidence region 

(4.21) 



Rir\s))^Ris) = {p\x'{s\p) < X^pt(l) + X^L,n} 



4.6 Hypothesis Testing 
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with Xopt (^) being the smallest value ( t{p) ) can take on for fixed s: 



XoptU) = {s\p^\s)) = minx^Ulp) 



(4.22) 



4.6 Hypothesis Testing 



2 i 



s 
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t(e) 
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x'(s,r'(s))<x', 



Figure 4.6: Hypothesis testing around a flat manifold 

The measure also offers a simple way to judge how plausible it is that the data s 
follow the assumed distribution around the theory prediction t{p). Again, a threshold 
obtained from the distribution is involved. It is important to note that here the relevant 
number of degrees of freedom (d.o.f.) is different than for the calculation of confidence 
regions. 



For hypothesis testing, we calculate Xmcas = Xopt(^™^'^**) for our measured sample ^'^^^ ^ 
and judge the hypothesis by the likelihood of occurrence of a smaller or equal value of 



= (27r)-^/2 f dq [ dr exp (-^ \\q + r\\l) Q ( ||r||^ < xLas ) 

= (Stt)-"^ dr exp (^-^ \\r\\l^ Q ( ||r||^ < xL^ ) 

= W^^-\xLj (4.23) 

Obviously, for hypothesis testing we need to evaluate the x^ distribution for {N — n) 
degrees of freedom, as opposed to the n degrees of freedom appearing in the error band 
calculation. 

The reason for this difference is the following: For error band calculation, q plays the 
major role because we define our error band on the manifold t{p). Conversely, for hypoth- 
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esis testing, the distance r to the manifold t{p) is relevant, because it characterizes the 
"distance to the theory". 

A value of W^^f ""^(xLas) 

very close to 1 means that the extremely unlikely accident 
has happened that the measured data are much farther away from theory than expected. 
That is - if the theory is valid in the first place. Therefore a "^(Xmeas) ^ 1 casts 

doubt on the validity of the predictor t{p) or might indicate underestimated or correlated 
uncertainties. 

On the other hand, a "''(Xmeas) close to represents a measurement that, compared 

to theoretic predictions, seems "too good to be true". Then one might want to check 
whether uncertainties are correlated or overestimated. 

For a large number of degrees of freedom (d.o.f.), the expectation value of is approx- 
imately the number of d.o.f, namely (A^ — n). Therefore, it is customary to specify "x^ 
per d.o.f." as an indicator of credibility. If Xmcas/(^ ~ ^) is much greater or well below 
1, one should be worried, for the same reasons as mentioned above. Note, however, that 
the distribution of Xmeas/i^ ~ ''^) becomes more peaked with growing number of d.o.f. 



4.7 Confidence Regions for Single Parameters and Derived 
Quantities 

Suppose we ask for the value and confidence interval of one single parameter pi, but do 
not care what values the other parameters might take on concurrently. This is a special 
case of a more general problem: Consider some quantity u £ IR™", which can be calculated 
from the parameters using a function /i : IR" — > IR™. We would like to have an estimate 
of u- How can the confidence region belonging to u be determined? 

In principle, we follow the method of construction presented in section 14.41 Through the 
maximum likelihood estimator p^^^, we have an estimate of the value of u, and vP^^{S) = 
h{p°^^{S_)) follows a known probability distribution. So under the assumption of u being 
the true value, we can select regions R[h]{u) C IR'" fulfilling 

G RMu)] = CL (4.24) 

Then the desired confidence region is 

R[h]{u^"') = {u I if'' G R[h]{u)} (4.25) 
R\h\ is constructed with the help of x^- We define level sets 

L{u) = |p I hip) = (4.26) 

and an effective measure 

Xlsiu,!^) = , min „ X'( tSp), tSp') ) (4.27) 
peL{u), p^eL{u') - — 

xlsilh) = min, xH 1 , t{p) ) (4.28) 



^The special case m = 1 and h{p) — pi amounts to the single parameter analysis. 



4.8 Errors in the Pion Mass 
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Aside from pathologic cases, for each n, M{u) = t{L{u)) is an (n — m)-dimensional con- 
nected manifold in data space, at least in the vicinity of t^^^^. Furthermore, we assume 
here that all the manifolds M{u) and the manifold swept out by t{p) are planar in the 
region of interest. For different u, the M(n) are then automatically parallel (otherwise 
they would intersect). The following definition provides confidence regions of the desired 
confidence level: 

miu) = {yf I Xcff(2i,2i') < XcL,m} (4.29) 
To see this, we make the decomposition 

s = f^"" +v + q + r (4.30) 

where v is parallel to the M{u), q is perpendicular to v but parallel to the manifold t{p), 
and r is perpendicular to both v and q, where now Vr is m-dimensional. The probability 
content is calculated in analogy to eq. (|4.1 

P 



JVs 

(27r)-^/2 1^ dvj^dqj^dr exp (^-i;^2(^true ^ ^ ^ ^ ^ ^^^truc) ^ q ||^||2 < ^2^^ 

{27:)-''/'' j^dq exp(-^||g||^)e( ||g||^<x^L,rn) j/^ j/^- ^^v{-\\\r + vf^ 
(27r)— /2 dq exp {-\ 6 ( ||^||^ < xIl, 



= W^^\xli.,J= CL (4.31) 
Making use of the Pythagorean theorem again, we find 

R[h]{u^\s)) ^ R[h]{s) = [U I Xe\ ( U ) < xlL,rn] 

= {u\xl^{r\s),u) < xlL,m} 

= {u\ Xlsi S\ u) < XoptU) + XCL,m} (4.32) 

= {hip) I {s\p) < Xopt(l) + XcL,m] (4.33) 
where again Xopt(l) = X^ {f''\s),s). 

Thus we deduce the following simple procedure to find the confidence region for u: De- 
termine the confidence region in parameter space using a x^ threshold for m parameters, 
and calculate the image under h. 



4.8 Errors in the Pion Mass 

In section the errors in the pion masses m.,^ = Xj it Axi were mentioned, but only now 
will it be shown how they can be taken into account. To get a feeling for the problem, let 
us look at an intuitive solution: In analogy to eq. (|4.2j) . the constraints take the form 



" mN{xi ± Axi] p) ^yi± Ayi 



(4.34) 
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The standard method of linear error propagation tells us how the error in Xi contributes 
to the error in m^: 



^ (- \ 

OXi 



Axi ^yi± Ayi " (4.35) 



If the errors in Xj and yi are independent, the two contributions to the error may be added 
quadratically: 



d 



mNixi]p) ^ yi± ^ {Ayi)'^ + \^—mN{xi]p)Axij " (4.36) 

It will be shown that a more careful analysis of the problem gives the same result. 

It is possible to specify an optimization problem of the form (|4.2|) which accounts for 
the errors in Xj. We introduce a new parameter Xi for each of the N data points. The 
complete set of optimization parameters is now p = {ui, ...,Un, xi, ...,X]\[)'^ ■ Here the Ui 
denote the parameters previously denoted pi and control the shape of the interpolation 
function mN{x]u). The fit constraints can then be specified as 

" fiip) = mN{xi; u) K, yi ± Ayi i = l..n " , , 

fi+n{p) = Xi ^ X, ± Axi i = l..N" ^■'"^ 

Using this notation, the statistical toolkit presented in the previous sections can be applied. 
However, it is neither practical to have so many parameters, nor are we really interested 
in the values of the Xi . Luckily, they are tightly constrained by the linear conditions in the 
second line of eq. 1)4. 37() . Making an approximation, the parameters Xi can be eliminated 
from the problem, according to the following general procedure. 



4.9 Eliminating Linearly Constrained Parameters 

Suppose we have n = m + k optimization parameters p = {ui, ...,Um,xi, ...,Xk)^ and a 
problem of the form 1)4.2(1 . The Xi are to be eliminated. The constraint functions /j must 
be approximately linear in the x^: 

ti{p) = 4^ ~ A{u)x + gi{u) (4.38) 
Ayi 

Here A{u) G iR^x'i: jg ^ matrix. If the only quantities we want to analyze statistically are 
the Uj, we can apply the recipe of section H771 According to eq. (|4.32() . the confidence 
region is given by the condition Xeff(^i 2^) — Xopt + Xxjl m ■ i^sA, needs to be done is 
to find an approximation for Xeff; which is given by 

XeffU)!^:) = min ||^(w)lE + 5((m) — sII^ (4.39) 
With the abbreviation b{u) = s — g{u), we are left with the famous problem to minimize 

1 1 2 

\\A-X — b\\2- It can be solved by a matrix decomposition 



A = Q R 



(4.40) 



4.9 Eliminating Linearly Constrained Parameters 
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where Q E [R-'^x^ ig an orthonormal matrix, and R G ig an upper triangular matrix. 

The norm remains invariant under application of Q or Q^, so 

P2-^ll2 = \\Q^ {QRx-b)f^ = \\Rx-Q^ hf^ (4.41) 

We assume here that R has full rank k.'^ The norm reaches its minimum when x is adjusted 
in such a way, that the k first components of R x — b vanish. Then only the last m 
components contribute to the norm. We decompose the matrix into an upper and a 
lower part 



Q 



T 



mxN 



and thus find*^ 



xl^{s,u)= Q^b 



Q^{u) {s - g{u)) 



(4.42) 



(4.43) 



Let us look again at the case of uncertainties in the pion mass. For simplicity, suppose 
there is only one lattice data point (x it Ax, y it Ay). The constraints are 



dmiy{x; u) 



dx 



" h{x,u) 

"/2(x,?i) = X 

dmiy{x; u) 



dx 



c/Ay\ 
1/AxJ 



with the abbreviations c 

A = 

Qii-factorization of A gives 

= with = , ^ ,„ + , ^ ,„ . 

VOy (Ay)2 (Ax)2 



(x — x) + m^[x\ uj 



y±Ay" 
X ± Ax " 



, y = rriNix; u), we read off 

^ ^ ^{cx-y + y)/Ay\ 
x/Ax J 



Q 



1 fc/Ay -1/Ax 
1/Ax c/Ay 



{Q\Q) 



(4.44) 



(4.45) 



(4.46) 



Q^b 



-1 e 
Ax Ay 



(cx-y + y)/Ay\ 
xj Ax J 



{y - vY 



c2(Ax)2 + {Ayf 



(4.47) 



The resulting expression for is in agreement with the result of the intuitive approach 
eq. (|4.36|) : The error in x effectively enlarges the error in y to ^Jc^{Ax)'^ + {^vY- It is 
easy to generalize the result to several lattice data points: 



Xeff 



E 



{mN{xu u) - yi 



dxi 



Axi) +iAy,Y 



(4.48) 



We can directly use this simple formula in our computer program. 



^If it did not, the minimum of would not be unique. This would be a badly formulated problem. 
^The following equation gives a prescription that is numerically both stable and fast. 
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4.10 Quadratic Approximation of x^? Error Matrix 

The crucial quantity for the determination of confidence regions and intervals is x^isip) = 
X^{s,t{p)), see eq. (|4.21|) . For many purposes, it is sufficient to perform a leading order 
expansion of X^{§.\p) i^i terms of p around its minimum Xopt P^^^ resulting in a quadratic 
form: 



^ dpi dpi ^ + + 2^^' ^^-^^^ 

We take a look at the hyper-ellipsoid 

H{T) = j/*^* + 5p ^6p^ B6p<T^ (4.50) 

For a threshold T = Xcl n' ^^^^ hyper-ellipsoid is the joint confidence region of all param- 
eters. Consider now a parameter dependent quantity S IR which can be calculated from 
the parameters using the function h{p). In section UTTI we saw that its confidence interval 
is h{ H{xcL i) )• methods to determine the image of H{T) under h are needed. For 
example, we could directly determine the upper bound Umax of the interval by maximizing 
h with the constraint ^6pi^ B^^i 6pi < T. Numerically, this is an involved task. It is more 
convenient to employ the reparametrization 

5pk{q) = V2T B-^l^ ^ sin H^H^ (4.51) 

where B~^/'^ is the symmetric matrix square root of the inverse of B. Then p'^^* -|- 5p{q) 
lies within the hyper-ellipsoid automatically, and an unconstrained maximization 

Wmax = max hip"""^ + 6p{q)) (4.52) 

q 

does the job. If h{p) is approximately linear within the confidence region, 

h{p) ^ hif""^) + 6p (4.53) 

we have an even faster method, h will take on its maximum value on some point of the 
boundary of the hyper-ellipsoid, and there its gradient with respect to p must be parallel 
to that of X^dlp)) so for some constant a 

- ^ B 6p = av =^ 6p = aB'^v (4.54) 
op - - 

a is determined so as to place 6p on the boundary of the ellipsoid: 



Tl^-5p^B5p = ^v^B-'v « = ^^f|^ (4.55) 

Substituting this into the expansion eq. (|4.53j) . we get 



h{p'''^) + aiF B-^ v = hip""'^) + ^/2 T v^B-^ v (4.56) 



4.11 Connection to Error Propagation 
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Because it is such a useful quantity, one introduces the error matrix E = 2 T B ^. 
Then we may write 

' ' (4.57) 



u 



The error matrix encodes the uncertainty of any parameter dependent quantity. For ex- 
ample, the uncertainties of the individual parameters pi are found directly on the diagonal 
of the error matrix: 

^ (4.58) 



Pi = pf^ ± 



In order to determine the uncertainty of a single scalar quantity, irrespective of other 
quantities, the threshold T is set to Xcl i- ^ confidence level CL = 68%, corresponding 
to one standard deviation, one finds Xcl 



4.11 Connection to Error Propagation 

The representation (|4.57|) with T = Xql i ~ allows us to make the connection to the 
standard method of error propagation, according to which 

(4.59) 

Here we have made use of As^ = 1- Einstein's summation convention is implied for all 
indices except for those explicitly summed over, p^^^ always minimizes so we 

determine its derivative by shifting the data s a little and analyzing the necessary shift 6p 
in p to stay in the minimum: 

dy'^is + 6s\p^^^ + 5p) d'^y'^islp^^^) d'^Y'^(s\p^^) 

= ^ ^ = l^i- ^ 6pi + ' 5su + ... (4.60) 

dpi opidpi opidsk 

Calculating B explicitly gives 

'dtk{p''')dtk{p''') , w est^^5'^fe(P"*)^ Mp""'') dtk{p'''') 



dpi dpm 

(4.62) 

if we can neglect the contribution proportional to the second derivatives of t (p) . Obviously 
this is always the case when the fit is very good. Then 

{Auf = V, {B-%^^^^^^^^{B-'),^ V, « V, {B-% ^Bi^ {B-'),m v, (4.63) 
and finally, with the definition of the error matrix for T = 1 

{Auf ^Vi^{B-^)jiVj=v^ Ev (4.64) 

Thus we have shown that error propagation applied to the maximum likelihood estimate 
is identical to error analysis using a threshold, as long as x^{^\p) is approximately 
quadratic in p and the approximation (|4.62() is valid. 
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4.12 Error Bands and Interpretation 

A central task of this thesis is the visualization of the uncertainties in our fit curve. Instead 
of just plotting the optimal fit curve, we display a band. There are two different formal 
concepts of constructing such a band, leading to bands of different width and requiring 
different interpretation: 

• One concept is to determine a confidence region R at a given confidence level and 
then define the band as the set of curves 

jfir : IR ^ IR g{x) = mN{x;p) , (4.65) 

Plotting all these curves, we sweep over an area consisting of the set of points 

%obal = \^ix,mNix;p)) p£ R^ (4.66) 

(The appropriate range for x will be subject of later discussions). This error band 
covers all curves that belong to a parameter set in the multi-parameter confidence 
region. All curves contributing to this global error band are likely candidates for 
the true function. 

• The other concept is to go first to a fixed pion mass x. Here, the nucleon mass 
is a function of the parameters, hx{p) = mN{x;p). One may now calculate the 
corresponding confidence interval R[hx], using the formalism discussed in section 
14.71 Assembling the confidence intervals at various values of x, we obtain a band 

!3ioce.i^{{x,y)\y€R[K]} (4.67) 

In contrast to the global error band, the width of this local error band Biocai at a 
given X directly corresponds to the statistical uncertainty about m^v at that point. 
However, the local error band should not be interpreted as the graph of a family of 
functions, because it does not cover all curves within the confidence region. 

In practise, ^global and i3iocai can be calculated using the same algorithm. There is but one 
difference: The x^-threshold XcL n ■Sgiobai is larger than the than the x^-threshold 

XcL 1 of 'Siocai- Consequently, the local error band is more narrow. 

Throughout this diploma thesis, the global error band iSgiobal is shown. It provides an 
answer to the question "Where might the true interpolation curve lie?". 



4.13 Uncertainties of Fixed Parameters 

It is not possible to extrapolate the pion mass dependence of the nucleon mass using a fit 
to the corresponding lattice data only. More information is needed. The literature offers 
a number of estimates for the LECs, obtained from the analysis of empirical data or other 
lattice calculations. At the present stage of research, it is sensible to fix parameters pi in 
our fit formula m7v( 2;; p) to these "empirical" values wherever reliable data is available. The 
interpolation function then takes the form mN{x;p,b), where b are the m fixed parameters 
and p are the n parameters that remain to be fitted. 
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The external estimates for the LECs bi have uncertainties, given in the form of intervals 
= [bi,min, bi,max}- We havc no information about the correlation of these uncertainties. 
We cannot assume that two different fixed parameter values bi and bj are samples of 
independent statistical processes. A conservative treatment of error propagation in this 
case is to consider all possible choices of the bi within their error intervals, and then 
to take the union of the resulting confidence regions. Thus for each choice 6 G C/;, = 
f^6i X X ■ ■ ■ xUb^ we calculate the confidence region Rb of the parameters p using the 
fit function mN{x;p,b). Prom these regions, we form the envelope 



This envelope is our conservative estimate of the uncertainties of p. Analogously, the error 
band is now an envelope of error bands at specific choices of b. 

4.14 Implementation of Statistical Error Analysis 

Two different algorithms have been implemented to produce parameter uncertainties and 
error bands: 

• a Monte Carlo exploration of the parameter space 

• a parabolic approximation in Mathematica 

The Monte Carlo method has the ability to handle deformed confidence regions, while the 
parabolic approximation always produces hyperellipsoids. However, in the final form of 
the fit problem, this approximation turns out to be sufficient. 

4.14.1 Hybrid MINUIT and Monte Carlo Algorithm 

The idea of the Monte Carlo search is simple. First, a bounding box is determined for the 
confidence region, i.e., minimal and maximal values for each parameter pi are calculated. 
Then, the parameters arc chosen randomly inside the bounding box. If the resulting 
is larger than an appropriately chosen threshold, this choice of parameters is discarded. 
Otherwise, the interpolation function is calculated, and the upper and lower bounds of 
the error band are modified wherever necessary to accommodate the new representant. 

The implemented program makes use of the CERN ROOT library, which includes a version 
of the MINUIT minimization routines. The program has an option to perform the band 
calculation using the error matrix provided by MINUIT instead of a Monte Carlo search. 
To determine upper and lower limits of the band, the function mN{x;p,b) is evaluated on 
a grid of pion masses ^i, ...,^nres- The choice of this grid only affects the extent and the 
resolution of the plot. Errors in the pion mass are neglected, so that is of the form 




(4.68) 



b&Ub 




(4.69) 



Now the algorithm looks as follows 
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• Select initial values for the fixed parameters: b <— ^start- 

• Minimize using MINUIT's MIGRAD command. 
This returns a set of optimal parameters p^^^. 

• Initialize the boundary variables of the band: 
'Smin(?fc) mNiCk]lf''^,k) VA: = O..nros 

• loop until user aborts or desired number of iterations is reached 

— Scan the fixed parameter range Rh on a uniform grid. For each b 

* perform a Monte Carlo search of 200000 steps or 

* minimize and perform a band calculation using the error matrix provided 
by MINUIT, applying eq. (|I37)) 

(In all steps, the variables for Smin('^fc), SmaxiCk) are only overwritten when the 
extent of the band becomes larger.) 



On each call to a Monte Carlo search 



• Find a bounding box Up for the parameters p fulfilling 

x^{y\p) < Xopt(y) + XcL,n g Up 

• for each Monte Carlo step, do 

— choose p randomly from within the bounding box Up 

— if X^{y\p) > Xopt(y) + XcL,n ' skip the rest of the loop 

— for A; = 1, .., nros do 

* if BminiCk) > mN{£,k;p,b), store iSminfe) ^ rnNiCk;P,k) 

* if i3max(&) < mN{Ck;P,b), store ^max(&) ^ 'rnN{^k;P,b) 

The bounding box can be optimally determined using MINUIT's mnerrs-command, ap- 
plied once for each parameter pi. Staying always in the minimum of x^ivlp) with respect 
to all other parameters pi, ...,pi-i,pi+i, ■■■,Pn, this procedure searches the largest interval 
] fulfilling x^{y\p) < Xopt(y) + XcLn- "^^^ bounding box in question is then 

— [Pl,!!!!!!! Pljimax] X • • • X [p7i,mirn Prtjinax] 

The principle of the algorithm is illustrated in fig. 14.71 A simple minimization yields 
^est^ with which one can plot the best fit curve (blue). The MINUIT mnerrs-command 
explores the parameter space, finding the bounding box that encloses the confidence region 
(zigzag paths terminated by crosses on the box). Then a random search selects points (red 
dots) within the bounding box. They are discarded if they exceed the x^-limit (light red 
dots), otherwise they lie within the confidence region (shaded blue). The corresponding 
interpolant is calculated and contributes to the band (red curve). 

Note on performance: In the ideal case, with no correlations and depending quadrat- 
ically on the parameters, the relative volume taken up by the confidence region in the 
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bounding box corresponds to that of a hyper-sphere in a hyper-cube. In n dimensions, 
the volume of a unit sphere divided by the volume of its bounding box is 



nr(f), 



1 



(4.70) 



which decreases rapidly with growing n. This means that the number of hits inside the 
confidence region decreases rapidly with a growing number of parameters. However, a 
certain number of hits within the confidence region is required to obtain a sufficiently 
accurate representation of the confidence region. Thus the algorithm becomes very ineffi- 
cient for large n. Experiments carried out with n = 6 still performed well enough. Yet, in 
order to apply the algorithm in a more challenging situation, the uniform distribution of 
trial points within the box would have to be replaced by something smarter. Metropolis- 
like exploration of the parameter space could be fruitful. Additionally, this would work 
without the help of a traditional minimizer like MINUIT, and one might be able to deal 
with non-Gaussian probability distributions this way. 



P2^ 



mjx;p) /\ 













i ^^^^^ 

























Figure 4.7: Hybrid MINUIT and Monte Carlo Method. 



4.14.2 Quadratic Approximation in Mathematica 

Mathematica offers greater flexibility and a friendlier interface for rapid development. 
Therefore, the statistical analysis has also been implemented in Mathematica, in 
quadratic approximation. Instead of MINUIT, Mathematica's native minimization rou- 
tines FindMinimum and NMinimize are used. The matrix B of eq. 1)4. 49() is determined by 
differentiating twice, using Mathematica's numerical differentiation. Upper and lower 
limits of the error band and the parameter confidence intervals are determined using the 
reparametrization eq. 1)4. 51(1 and 1)4.52(1 . or, as a cross check, using the error matrix as in 
eq. (|4.57|) . Evaluating 



x^(y|p) -Xopt(y) 



1 (4.71) 



at random points p on the boundary of the confidence region gives an estimate of the 
relative error in the uncertainty caused by non-quadratic behavior of x^- Uncertainties in 
fixed parameters b are treated in the same way as in the Monte Carlo approach: The box 
Ub is scanned on a uniform grid (Typically, it the constraints fi{p,b) behave monotonously 
in b, and it is sufficient to include the corners). 
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In our nucleon mass fit, errors in the pion mass are treated using an effective ^ as in 
eq. ()4.48|) . The requirement for setting up this effective is the approximate linearity 
of m]\f{x;p,b) for x ^ Xi + 0{Axi). If this holds, we may replace the derivative by a 
finite difference with a gap size of the order of Axj. This is numerically advantageous. 
Altogether, the employed here is 

^^iy\p) = f; [m-m^{x,,p,b)]' 

i=i [rriNixi + |Axi; p,b) - niNixi - ^Axf, p,b)] + [Ay^]^ 

The multi-parameter x^-increment employed is XcLn' corresponding to the number n of 
fitted parameters p. 



4.15 Note on Conventions 

Throughout this work, we stick to a confidence level CL = 68%, unless specified other- 
wise. Errors of fit parameters are specified in terms of single parameter error bounds, 
using XcL 1 ~ This complies with the standard conventions in the literature, and corre- 
sponds to the errors given by MINUIT in the default mode. Note that the multi parameter 
confidence region is not covered entirely by the bounding box from single parameter errors 
|Jamj . 
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5.1 General Remarks 

Previous sections have discussed at length how uncertainties in the input data propagate 
into uncertainties about our fit parameters and the interpolation curve. Yet, this statistical 
error analysis always forced us to make a central assumption: that the theory describes the 
data correctly and accurately. We know, however, that this is not the case. For one thing, 
the data may be subject to systematic distortions. Furthermore, the fit function from 
theory is just a series expansion truncated at some order of small momenta p. Even if the 
expansion coefficients (LECs) were known precisely, our fit function deviates from truth 
significantly once the expansion variables reach a certain magnitude. These deviations are 
often referred to as higher order effects. Our error analysis must therefore work in two 
steps: 

1. Statistical error analysis: Make an assumption on the range of applicability of 
the theory. Assume that the theory is exact in this range, i.e. ignore theoretical 
uncertainties^. Pick data from the range of applicability. Fit theory parameters 
to data in that range. Using the formalism presented in chapter HI determine the 
uncertainty of theoretic parameters (here the LECs) resulting from the uncertainty 
in input data (here e.g. the lattice data points). 

2. Theory error analysis: Verify that the assumption of a negligible/tolerable theory 
error in step ^ is justified. If it is, the uncertainties determined in step ^ can be 
accepted as the final result of our analysis. If not, try to estimate an improved range 
of applicability. For an effective theory this means: Make an intelligent guess about 
the magnitude of the deviation of the fit function from truth, which results from 
the fact that the fit function is a truncated series expansion. To make this guess, 
the approximate size of the LECs is needed. There is no other choice but to make 
use of the results of step ^ Find the upper limits of the expansion variables where 
theoretic errors are expected to be acceptable, and go back to task^ 



^Including theoretical uncertainties in the statistical analysis is difficult. It has been attempted in the 
course of this diploma thesis, but the results were not convincing, perhaps in part due to a lack of enough 
redundance in the input data. 
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Despite tlie interplay of the two steps, theoretical and statistical error estimation require 
entirely different methods. It is dangerous to confuse the two different tasks, see section 
16.2.51 Ways of estimating higher order effects will be illustrated in situ. In general, it is a 
"dirty" task, lacking mathematical rigor. 



5.2 Convergence Properties 

Heuristic arguments show that perturbative QCD yields asymptotic series |WesOO) . 
To understand what an asymptotic series is, consider a Taylor series expansion Sn{z) = 
Y17=o '^'i--^^ some function f{z). Sn{z) is said to form an asymptotic series if 

for fixed z : lim ^ fiz), (5-1) 

n— >oo 

but for fixed n : 3C„: - /(z)| < C„z"+^ Vz (5.2) 

In most cases, the series is divergent, i.e. lim„^oo Sn{z) does not even exist. Even though 
Sn{z) does not converge towards the correct result for n ^ oo, it does yield good approx- 
imations for f{z) at some order n of the series, if only z is sufficiently small. At fixed z, 
there is some order n = ?ibest(-2) where the series gives its best approximation to f{z). 
When calculating Sn{z) to higher and higher orders, results start becoming worse again 
beyond nbest(-2)- Typically, the lower z, the higher is nbest(-2)- For QCD matrix elements, 
the expansion parameter z is related to the (running) coupling constant a^. 

It is reasonable to suspect that yields asymptotic series as well. For our nucleon mass 
formula, the expansion parameter z is related to the pion mass m-^j h.^. In particular at 
larger pion masses, we must take into consideration the possibility that efforts to go to 
the next order are in vain, and turn out to worsen our result. 



Chapter 6 



Chiral Extrapolation of Nucleon 
Mass Lattice Data 



We turn to the numerical application of the machinery introduced in the previous chapters, 
fitting infrared regularized SU(2) B^PT nucleon mass expressions to unquenched two- 
fiavor lattice data. 



6.1 Survey of Available Lattice Data 



Our primary collection of two-flavor lattice data is that of ref. AK"'"n4) . listing fully 



dynamical simulation results from the QCDSF, UKQCD, CP-PACS and JLQCD collabo- 
ration. Secondly, we make use of data presented in ref. |()LSn5j . comprising recent results 
from SESAM and related projects. The selection of input data has to match the capabili- 
ties of the model function. In a later section we will explore the ability of xPT to correct 
for artifacts due to the finite lattice simulation volume. Therefore, we already show here 
data with noticeable shifts due to smaller lattice volumes. Table Ib!T] lists the two-flavor 
lattice data used in plots and fits, up to a pion mass rriT^ < 1.0 GeV. The horizontal lines 
separate "volume groups" of equal simulation parameters k, collaboration). For each 
group, the simulation on the largest lattice is marked "large L". The following selection 
criteria are supposed to keep spurious lattice artifacts at a tolerable level: 



lattice spacing a < 



0.15 fm for C'(a)-improved Fermions; i.e. data from AK"'"04| 



0.10 fm for unimproved Fermions; i.e. data from |()LS05j 



• simulation volume side length L > 1.0 fm , because, according to |OLS05j . finite 
volume corrections based on xPT seem to be applicable in this regime. 

• For each volume group, the simulation in the largest volume fulfills mT^L > 5. These 
points are marked "large L" and exhibit negligible finite volume effects. (The pion 
cloud, whose size is of the order of the Compton wave length l/mT^, must fit well 
into the simulation volume.) 



The criteria for the "large L" points for improved Fermions are those of ref. AK"'"04 
also employed in ref. |PHW04] . 
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For points 53 and 54, no calculation has been performed for the Sommer radius ro/a, which 
is needed to determine the lattice spacing a. In accordance with ref. |ULS05j . ro/a for 
these two points is copied from point 56, which features the same simulation parameters 
13, K. 



For the unimproved Fermions of ref. |()LSfl5j . we expect larger discretization errors at 
comparable lattice spacings a. Looking at fig. 16.11 the decision to place the cut for 
unimproved Fermions at 0.10 fm seems reasonable: Unimproved Fermion calculations for 
coarser lattices visibly deviate from the general trend of high quality data points. We have 
marked them "coarse" in table Ib!T1 



2 
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1.7 

a L6 

z 
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unimproved, 0. 1 fm < a < 0.15 fm 



0.6 



0.7 0. 

[GeV] 
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Figure 6.1: "Large L" {m-,^L > 5) lattice data from simulations with improved Fermions 
|AK"'"04| and and unimproved Wilson Fermions |ULS05j . 

The conversion of lattice data to physical units has been performed according to eq. 
13.321 Statistical errors in masses am and in the length scale rg/a have been propagated 
quadratically. 



6.2 Application to Large Volume Lattice Data 

A fit of the expression ^ as in eq. (|2.79|) to lattice data has already been shown in 
ref. pr'HW04] . This is to be supplemented with a thorough discussion of uncertainties. 



6.2.1 Higher Order Effects in the Pion Mass 

Actually, rn,^'^ ^ = m^*^ ^(m7r; mo, ffA, /tt, ci, C2, C3, Cj^^^ (A)) is a function of the leading 
order pion mass m,r = V^Bm - not the physical pion mass provided by lattice calcu- 
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lations. Our assumption here is that the difference is negligible. This assumption seems 
to be justified, because the formula 

ml ^ 2Bim (6.1) 

describes lattice calculations of the pion mass as a function of the quark mass very well, 
if Bi is fitted to the data, see fig. 16.2b . It appears self-evident that this means that that 
the GMOR relation is very accurate without further corrections at higher orders: 

ml^fnl = 2Brh (6.2) 

In other words, the slope Bi observed on the lattice "naturally" is identical to the pa- 
rameter B. We are going to accept this common presumption for our SU(2) fits. Yet, I 
would like to mention that this is still an open issue. Let us take a look at the pion mass 
expansion to order / |(;L84j : 

ml = ml + (2{47rfm + 1 In ^) + 0(/) = fnlK (6.3) 

where Rj^ can be expressed as 

R. = l + i^^J^2 1" § + '^^P') ' = [-4(4vr)2/5(A)] (6.4) 

The correction term vanishes in the chiral limit and at = h^. In between lies a 
minimum whose magnitude is proportional to A^. No matter how we choose A^, we 
get sizeable corrections for pion masses below 0.6 GeV. Note that lattice calculations 
have not reached pion masses much lower than 0.3 GeV. It is possible that deviations 
from eq. 1)6. If) set in there. Let us take, for example, the central value of the estimate 
I3 = — 4(47r)^/3(m7r) = 2.9 it 2.4 obtained from the study of meson and quark mass ratios 
in ref. |GL84j . As in ref. |Liis05j . we plot R-,^ over m^, using 0.0924 GeV, see fig 

16.2b . Since the value of B is determined from the fit, any functional behavior of R-j^ijri-,^) 
which is approximately constant for m-,^ > 0.3 GeV is compliant with lattice data. In 
other words, we can still fulfill eq. 16.11 in the range of available lattice data, if we make 
higher order corrections starting at order responsible for the formation of a plateau 
(figuratively displayed as the dashed line). Note however, that now B ^ Bi. Since we 
surely work in the plateau region of Rt^, we could now utilize 

_ B , , 

= —m^ (6.5) 
-t>i 

as input to our nucleon formula - if we only knew B/Bi. Just to get an impression of 
a possible magnitude of this ratio, let us make the assumption that the plateau lies at 
the level of the minimum of Rj^, as plotted in fig. 16.21 Then for I3 = 2.9 we obtain 
Bi/B = 0.95. An attempt to extract information from MILC SU(3) data in section [7| 
even suggests Bi/B ~ 0.9 in the relevant range, see the low lying brown line in fig. 16.2b . 

For the practical purposes in the following SU(2) nucleon mass fits, we have ignored the 
concerns about higher order effects in the pion mass, and have made no difference between 
and fn.,^. 
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Figure 6.2: a) Taken over from ref. |Lus05j : SU(2) lattice results for the pion mass plotted 
versus the current quark mass m in lattice units. The dotted line is a linear fit of all four 
data points. Below niT^ = 500 MeV, one-loop chiral perturbation theory at order fits 
the data equally well (full line). The point represented by an open square is obtained 
by extrapolation to the physical pion mass. b) p'^ correction factor R.,^ to GMOR. The 
thick solid line depicts the central value of the result of ref. |(TL84j I3 = 2.9 ± 2.4. The 
low lying brown line corresponds to our analysis in chapter [7| finding = 3.9, see also fig. 
17.31 The dashed line figuratively suggests that effects beyond the approximation might 
level out the correction factor i?^, which is what lattice data indicates. 



6.2.2 Setting up the Fit 

Without loss of generality, we set the regularization scale to A = 1 GeV. Restricting 
ourselves to "large L" lattice data with the smallest available pion masses we pick points 
no. 19, 41, 8 and 23 from table IbTTI ^ In order to avoid an under-determined fit, we must 
fix some parameters. We eliminate ci by substituting the experimental values of pion and 
nucleon mass 

into formula (|2.79|) : 

m''N^ \mP^y'';mo,gA, /°, ci, C2, C3, e^''''(A)) = mp^'' ci{mo,gA, /°, C2, C3, e^"^^) 

The LEG C2 is set to 3.2GeV~^ as determined from pion-nucleon scattering in ref. 
|FMS98j . and C3 is fixed at — 3.4GeV~^, according to the NN phase shift analysis in 
ref. |EM02j . Particularly the uncertainty about C3 is substantial, but difficult to quantify. 
For the moment, we ignore uncertainties in C2 and C3, and defer the discussion to section 
16.31 As a first approximation, we set and equal to their values at the physical point 

9a « = 1-267, /° ^ /f = 92.4 MeV (6.6) 

Actually, the chiral limit values g^ and /° are expected to differ slightly from the values 
at the physical point. We can safely assume to find g\ and in the intervals 

g^A = 1-1 •• 1-3, /° = 86.2 MeV .. (92.4 MeV = /P^y'') (6.7) 



^Our choice to select data with < 0.6 GeV was initially motivated by a recommendation in ref. 
|BHM04) not to go beyond pion masses larger than 0.6 . . . 0.7 GeV. 
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The range for (7^ is taken from |HPWn3] . For the lower boundary of we take the lowest 
estimate we found in the literature: a numerical analysis of the pion mass dependence of 
/tt in ref. |Goc04j . Eventually, only two parameters remain free: mo and e[^^(A = 1 GeV). 

In summary, we have fixed parameters c = {g\, C2, C3) and fit parameters p = (mo, e[^^). 



6.2.3 The Calculations 

Originally, calculations were performed using the Monte Carlo algorithm described in 
section 14.14.11 Errors in the pion mass were neglected. g\ and were varied on a 
uniform 11 x 11 grid spanning the range ()6.7() . The resulting error bands and confidence 
regions were combined to what we termed a systematic envelope^'. As an example, 
for the band calculation of the CL = 68% systematic envelope, the number of parameter 
trials was in total 2 X 10^. Thereof 2 x lO'^ were in the region of confidence. The band grew 
in some direction 1023 times. For g^j^ and fixed at the physical values, the fit results 
of this setup are identical to those of Fit II in ref. |PHW04] . They are listed again for 
convenience in table 16.11 ^ Systematic envelopes of error bands are shown for CL = 68% 
and CL = 95% in fig. 16.3b . In the plots, we choose as our abscissa, because it is 
approximately proportional to the quark mass rfi. 

Later, the quadratic approximation described in section 11.14.21 was employed, because of 
Mathematica's greater flexibility. Random inspections using expression (|4.71j) show that 
the extent of the confidence region does not deviate from the hyperellipsoidal approxima- 
tion by much more than 5% in any direction. The lattice error in the pion mass is now 
taken into account, but the effect is hardly noticeable. Fixing g^j^ and at their physical 
values yields the "statistical error band" of fig. 16.3b . at a confidence level CL = 68%. The 
systematic envelope was calculated at the same confidence level, letting g'^ and assume 
values on a uniform 5x5 grid. The confidence intervals of the parameters are listed in 
table 10 

Fig. 16.31 shows the resulting error bands calculated with the two different algorithms. 
Table 6.1: Fit II as in |PHW04] : parameter values and observables 





(GeV-3) 


2.9 ± 0.7 


fitted 


rtiQ 


(GeV) 


0.8820 ± 0.0028 


fitted 


Cl 


(GeV-i) 


-0.927 ± 0.037 


elim. 


9a 




1.267 


fixed 


fO 


(GeV) 


0.0924 


fixed 




(GeV-i) 


3.2 


fixed 


C3 


(GeV-i) 


-3.4 


fixed 


xVd.o.f. 


0.15 


(TiV 


(GeV) 


0.0491 ± 0.0028 



^The tables also list resulting values of other observables T""", and ajv which we will discuss later. 
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Figure 6.3: Global statistical error band for a fit to large volume lattice data with 
< 0.6 GeV. The physical point is included, and we fixed C2 = 3.2GeV~^, C3 = 
— 3.4GeV~^. The grey lattice data points have not been used as input. They are only 
shown for comparison, a) Fit II: systematic envelope from the Monte Carlo algorithm at 
confidence levels CL = 68% and CL = 95%, pion mass errors ignored b) Fit lib: error 
analysis in quadratic approximation at CL = 68%, pion mass errors included. 
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Table 6.2: Fit lib: parameter values and observables. Pion mass errors have been taken 
into account. 





(a) statistical error 


(b) systematic envelope 


eS^^(lGeV) 


(GeV- 




3.0 ± 0.7 


fitted 


1.0 


.. 4.6 


fitted 


mo 


(GeV) 




0.8820 ± 0.0030 


fitted 


0.876 


.. 0.888 


fitted 


Cl 


(GeV- 




-0.927 ± 0.039 


elim. 


-1.04 


.. -0.82 


elim. 








1.267 


fixed 


1.10 


.. 1.30 


scanned 




(GeV) 




0.0924 


fixed 


0.0862 


.. 0.0924 


scanned 




(GeV- 




3.2 


fixed 


3.2 


fixed 


C3 


(GeV" 


1 \ 


-3.4 


fixed 




3.4 


fixed 


X /a.o.t. 






0.13 




0.1268 


.. 0.1346 






(GeV- 




7.22 ± 0.35 




6.3 


.. 9.6 




p+(<3) 


(GeV- 


') 


598.69 ± 0.45 




570. 


.. 770. 






(GeV) 




0.0490 ± 0.0029 




0.0446 


.. 0.0537 




f'+{<3) 


(GeV- 


') 


2.67 ± 0.35 




1.8 


.. 4.3 




p+(<3) 


(GeV- 




837.63 ± 0.45 




810. 


.. 1050. 




ef^(lGeV) 


(GeV- 


') 


1.83 


fitted 



6.2.4 Discussion of the Result 

In the limit 0, the error band remains narrow, reflecting our ability to determine rriQ 

accurately. (Of course, this prediction relies heavily on the inclusion of the physical point, 
which is rather close to the chiral limit.) For pion masses below 0.6 GeV, i.e. left of the 
four selected lattice points, the band does not bulge much. As long as information about 
the physical point and the LECs is included, chiral perturbation theory provides a stable 
interpolant which is conditioned well and can make predictions within its range of validity. 
The agreement between best-fit curve and lattice data up to m-j^ ~ 750 MeV, close to the 
typical chiral symmetry breaking scale A-^, has to be considered accidental with present 
information in input. The diverging statistical band indicates that the predictive power 
of our analysis about the shape of the curve in the region ttItt > 600 MeV is low: the 
information we have in input (in the form of lattice data and parameters), constrains the 
shape of the interpolating functions weakly at pion masses larger than 600 MeV. This 
coincides with our expectation that higher order effects become intolerable at larger pion 
masses. 

Table IHTTl gives a summary of parameters and single parameter error bounds. Note that 
the individual fit parameters depend heavily on the choice of and resulting in large 
"systematic envelopes" in table column (c). The same phenomenon can be observed 
visually in plots of the confidence region, see fig. 16.41 Note that the statistical correlation 
between e[^^ and niQ is considerable, but still allows us to perform a stable fit. In contrast, 
a parametrization in terms of mo and the pion nucleon sigma term ajsf would not have 
been possible - here the confidence ellipse degenerates to a line, see fig. 16.41 1). This is due 
to the fact that e^^^ is essential for the description of the curve in the range where we have 
data. 
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Figure 6.4: CL = 68% 2-parameter confidence region for Fit lib. Light blue oval with 
thick line: confidence region for g^^ = g^^^^ , = fn^^''- Green regions: confidence regions 
for a scan of and Together they form the systematic envelope, bordered by the 
outer line. The confidence region for the fit parameters a) has been translated in b), c), 
d) to show the statistical correlation between other quantities of interest. 



6.2.5 Higher Order Effects in and 

Like the nucleon mass, the axial-vector coupling constant gA and the pion decay constant 
/,r can be expanded in terms of quark masses, or more conveniently rriTr- They differ from 
their chiral limit values by correction terms starting at order m^, see e.g. ref :BKM 95] : 



5a = 9A{rn^){l + 0{ml)) , = /.(mO(l + 0{ml)) 



(6.8) 



When substituting these expressions into m^^^ \ the corrections appear at order m^. 

Using the full axial- vector coupling gA{^n) and/or the full pion decay constant fnijnn] 
as input to the nucleon mass formula 



m^^'-'(m,r, 9A("i7r), /ttC^Itt), cl, c2, c3, e^i^*^ ) 



(4). 
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vr 



(1) (2) (3) (4) 

Figure 6.5: Corrections entering the pion loop contribution to the nucleon self energy. 
The pion in diagram (1) couples to the axial-vector current of the nucleon. (2)-(4): Some 
corrections to the interaction vertex can be explored by probing the nucleon with an 
external axial- vector field (wiggly line). These corrections are subsumed in g^. 



actually amounts to the inclusion of some graphs of higher order. In particular, it modifies 
the description of the coupling of the pion to the nucleon in diagram 12.21 (a) which occurs 
at order p^. The pion couples to the axial vector current of the nucleon, whose strength 
is parametrized by qa- Thus corrections to will improve the description of the vertex, 
as illustrated in fig. 16.51 

In this context, the findings in ref. |MB99j should be mentioned: A two-loop heavy baryon 
calculation shows that the contributions to the nucleon mass at order and p^ are 

The term of order is already present in our expression m^^'^\ because it is a recoil 
correction automatically obtained from a relativistic calculation. All other contributions at 
order p^ arise naturally through the use of the full pion-nucleon-nucleon coupling constant 
9iTNNiiTT"K), the full pion mass m,r and the full nucleon mass m^r in the term proportional 
to m^. There is a close relationship of g^NN to gA and /jr, the difference being called the 
Goldberger-Treiman discrepancy 

A.;v;v - 1 - = 0{ml) (6.10) 

^■kNN is known to be tiny at the physical point (~ 4% |BKM95j ^. but we do not have 
information about its evolution with the pion mass yet. It is the Goldberger-Treiman 
discrepancy which is responsable for the fact that use of gA and /vr instead of their chiral 
limit values does not suffice to promote our nucleon mass formula to order p^ . 

All in all, the use gAim-w) and/or f^imT^) at least amounts to a partial evaluation of 
higher order diagrams, giving contributions starting at order m^. Thus, as mentioned in 
ref. |Bean4j . evaluating 

6mf^ ^ mr ^(-., gA{m.l U{m.U„c„c„ e^^) 

- 'm'N'^'\mT,,g%fl,ci,C2,C'i,e{^) (6.11) 

can serve as a measure of the size of some higher order contributions starting at order m^. 
Wherever these contributions are large, we must conclude that the next order correction is 
likely to be significant, and that the theoretical uncertainty should be estimated as high. 

In an attempt to perform such a study on higher order contributions, ref. |Bean4j performs 
the following steps: Unknown parameters p are fitted to the data using g^^^^ and f^^^^- 
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Next, confidence intervals for g\ and similar to ours in eq. 16.71 are estimated. Now 
iT^N i^n, 9 fn^P) is plotted, keeping the fit parameters p constant and varying and 
within their confidence intervals. The result is a huge band, giving the impression 
that higher order effects are intolerably large. However, to my present understanding, the 
author of ref. |Beafl4j has confused statistical and theoretical error analysis. The produced 
band does neither possess the claimed meaning as an estimate of higher order effects, nor 
does it correctly reflect our uncertainties in the fitting procedure. 

Varying g^ and within their confidence intervals is part of the statistical analysis. 
Here, the impact of incomplete knowledge of input parameters on our output function is 
assessed. In order to treat correlations correctly, it is necessary to refit the parameters 
p for each choice of g^ and Fig. 16.61 shows fits for two different extreme choices 
of g^ and f^. Obviously, the shape of the interpolant remains largely unaffected. The 
impact of uncertainties about and also manifests itself as the difference between our 
"systematic envelope" and our "statistical error band" in fig. l6.l-{[ which is quite small. 
Obviously, the effect of uncertainties in g^ and is far less dramatic than pointed out in 
|Bear)4) . Note that the magnitude of higher order corrections does in no way depend on 
our uncertainties in the input! 
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Figure 6.6: Fits at two different extreme choices of g^ and 

On the other hand, discussing higher order effects requires the knowledge of the functions 
dAi^n) and f-nim-w) over the whole range of rriT^. Given these functions one may apply eq. 
16.111 Of course, the procedure is consistent only if g^j^ = gAifn-w = 0) and = fnirnj^ = 0). 
The quark mass dependence of /^r is still not very well known. Recent attempts to fit to 
lattice data do not perform well in the region where lattice data is available, see e.g. 
ref. |Goc04j . For gA, however, we may take the best fit curve with eq. (12) in ref. 
|HPWfl3] . The formula therein has been calculated in non-relativistic SSE to order e^. As 
usual, we fit e^^^ and tuq using our formula rn^^'^ \mT^, gA{0), ...), fixing = 0.0924 GeV, 



6.2 Application to Large Volume Lattice Data 



69 



C2 = 3.2GeV~^, C3 = — 3.4GeV~'^. Next to this best fit, we plot 'm^^'^\'mTr, gAifn-w), ■■■)■ 
The difference between the two plots is not severe, meaning that higher order effects from 
diagrams related to gA should not play much of a role. Again, we contradict the statement 
made in |Bea04j . 
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Figure 6.7: "Higher order effects" made visible by replacing —>■ QAi'm-n)- 



6.2.6 Convergence 

The straightforward way of assessing convergence properties are plots at different orders 
in the perturbation expansion. For this purpose, we take parameters from our fit with the 
best fit function available to us: mj^^ ^ from formula ()2.79|) . Then we plot curves using 
these same parameters at lower orders, i.e., nv^ from formula 1)2. 78() and rrv^ from 
formula (|2.77j) . As mentioned in section [2.121 matters are complicated by the role oiC^^j^ at 
different orders, e^^^ (A) and ef^ (A) are actually not the same coupling constants. However, 
the functional form of m!"^^ does not permit us to discriminate between the contribution 
from e\ (A) and the additional contributions entering at order p^. A naive approach to set 
e{\\) = e[^^(A) is A-dependent, because the running of the two coupling constants differs: 
Requiring scale independence of the individual formulae m^'^^ and \ we obtain 

eS^)(A) - )(Ao) = f ^ - 8c, + C2 + 4c,] In A (6.13) 



32^2(^0)2 y . . z , 

The reason for the different running behavior are two more loop diagrams entering only 
in a genuine calculation, see fig. 12.21 (b,c). 
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At present, there is no way to figure out numericahy which part of (A) belongs to 
e^'^^(A). This thwarts our intention to import all parameters from the p^-fit to p^. The 
best we can do is to determine (A) from a secondary one-parameter p^-fit to the data. 
Of course, this compromises the stringency of our convergence test. 

For the plot in figure the central values from the fit at order have been substituted 
into the formulae at order p'^, and p^, leaving ep^(A) open. From a one-parameter fit 
of the function to lattice data we obtain ef^(lGeV) = 1.38 GeV-^. Figure Ol conveys 
the impression that convergence is formidable in the whole area m-,^ < 0.6 GeV. Here, 
the p^ and curve run very closely together. In the shaded area > 0.6 GeV the two 
curves separate quickly. At such large pion masses, the curve can no longer be treated 
as a small correction to the p^ curve. It should be admitted, that this outcome of our 

(3) 

convergence analysis is strongly influenced by the fit of e\ (A) to the data. Certainly, the 
intersection of the p^ and curves in the midst of our data points is not an accident. 

Therefore it is interesting to study a modification of our assumption on the range of 
applicability of the theory. Taking input from lattice data of pion masses up to 0.8 GeV, 
we obtain the results shown in fig. 16.91 The error band at order p'^ becomes more narrow 
due to the increased statistics. Again, the p^ curve intersects the p^ curve in the center 
of the lattice data range - this time at a larger pion mass. Thus our convergence analysis 
strongly depends on the placement of our input data. This is undesirable, yet it does not 
prevent us from drawing conclusions: The overall convergence inside our assumed range 
of applicability < 0.8 GeV is visibly worse than it was for the < 0.6 GeV fit, and 

(3) 

that even though e\ (A) was chosen in the most favorable way. 
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Figure 6.8: Study of convergence: nucleon mass predictions from formulae at order p'^, 
p^ and p^, using the parameters from the p^ fit. At order p^, e^^^(A) cannot be imported 
from and needs to be determined from a secondary fit to the data. 
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Figure 6.9: Study of convergence for a fit to data of pion masses up to 0.8 GeV. 



6.2.7 Ab-initio Extrapolations? 



So far we needed to fix some LECs to values from previous empirical studies. This is 
not entirely satisfactory. What about our initial goal to make physical predictions from 
pure extrapolations of lattice data? Surely, such an "ab-initio" calculation would require 
lattice data points at lower pion masses. But how low? In order to address this question, 
we extend our input lattice data by placing "fake" data points on the best fit curve we 
already have. Then, band calculations are performed - this time with less fixed parameters 
and without inclusion of the physical point. This way we can explore how future lattice 
calculations at lower pion masses will improve our ability to make predictions. 

We begin our analysis by plotting Fit II from table 16.11 On the fit curve, we calculate 
the points at rn-Tr = 0.3 GeV and = 0.4 GeV. We append them to our set of input 
lattice data points, giving them uncertainties of ±0.01 GeV. Next, a band calculation is 
performed on the extended input data set. This time, the physical point is not included. 
In our study, we keep g\ k. g^^^ and « fw^^'^ fixed, but we release all other LECs. It 
should be noted that the LECs parametrize the interpolation curve with some redundancy. 
As in ref. pPIIW04j . we eliminate this redundancy by introducing linear combinations 

3C2 



1287r2(/; 



0\2 



B = C2 + 4C3 



(6.14) 



Thus the variables p of the error analysis are mo, ci, A and B. The result is the green 
outer band of fig. 16.101 Introducing one more "fake" data point at m-,^ = 0.2 GeV narrows 
down the band to the area shaded red. 

The result of this and similar studies is clear: Within the forseeable future, an ab-initio 
extrapolation of lattice data down to the physical point using our fit function ni'^ will 
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Figure 6.10: Ab-initio band calculations for fits to fake lattice data input at low rriTr. 

not be possible. The problem is badly conditioned. Actually, this is not so surprising. The 
fit function is similar to a polynomial. Polynomial extrapolations are known to have bad 
extrapolation properties. 

Is this a reason to give up chiral extrapolations of lattice data altogether? No, it is not. 
First of all, we can study and fit multiple observables with the same set of parameters 
simultaneously, thereby greatly improving the condition of the extrapolation problem. An 
example of such a simultaneous fit is given in section 17.11 Secondly, chiral perturbation 
theory can describe the effects of other lattice simulation parameters, like the size of the 
simulation volume. In section E31 we demonstrate that this way valuable additional input 
becomes accessible. Last but not least, we can incorporate and check compatibility with 
empirical information. We give an example of this possibility in the next section. 



6.3 Consistency with Empirical Constraints 

If results from lattice extrapolations are physically meaningful, they should be compatible 
with empirical results for observables whose chiral expansions involve the same parameters. 
Here we study a collection of such quantities: 

• T+(m-7r), the isospin-even ttN S-wave scattering amplitude at threshold 

• P{*'(m7r), the isospin-even non-spin- flip vrA^ P-wave scattering amplitude at threshold 
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• (Ttvi the pion nucleon sigma-term 



The empirical results can be compared to the xPT predictions evaluated at physical con- 
ditions, i.e. at uIt^ = m]^^'^. 



6.3.1 Monitored Quantities and their Chiral Expansions 

T"'"(m7r) is related to the isospin-even vrA^ scattering length via 



T+(m^) =47r 1 + 



iriN 



From the precision study of pionic atoms, the authors of ref. |ELTn2j extract 
a"*" = (—12 zb 2 zb 8) • 10~^m^^, which provides us with an empirical value 



r+ = (-0.12 ± 0.11) GeV"^ 



(6.15) 



(6.16) 



Note that this value is anomalously small. For comparison, the isovector scattering length 
is a" = (895 ± 3 ± 13) • IQ-^ru'l |FJ.Tn2j . In HBxPT, contributions at order and 
read |BKM97| 



j.+ (<2) 



2mi 



o^2 



C2 + C3- 2ci 



Ml! 

8mo 



.(3) ^ 3(g°)^m3 
647r(/0)4 



(6.17) 



P+(m7r) can be expressed as a linear combination of scattering volumes 



P+(m^) = 47r 1 + (4a33 + 2a3i + 2ai3 + «ii) 

V ruN J 



(6.18) 



One finds values for the empirical scattering volumes collected in |TE03j with data from 
|Koc86j . |KP80j and |FM97j . totalhng to 



Pf = (1044 ± 38) GeV" 



(6.19) 



In HBxPT, contributions at order p'^ and read |BKM97| 

2 f m-jr \ {Q^iYm^ "+(3) _ {g^Yf^T, ffj)^2 , 77 
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Last but not least, we employ information about the pion-nucleon sigma-term, see eq. 
(HHJ. Ref. |(;LS9H provides us with the empirical value cjjv = (0.045 ± 0.008) GeV, and 
argues that a significantly larger value for aj^ would lead to implausible consequences: Ei- 
ther the strangeness content of the nucleon would be substantial ((A^| russs \N) ~ 0.4 GeV) 
or the GMOR relations would be accidental and the quark condensate {qq) would have 
to be suppressed. Therefore, we choose here to stick to the value above, rather than 
to the more recent TRIUMF results |PSWA02| . still under debate. The latter found 
Ed = (0.067 ± 0.006) GeV, which according to ref. |(;LS91j relates to the sigma term via 
a correction A = (-0.003 ± 0.001) GeV, such that ajv = -b A = (0.064 ± 0.007) GeV 
comes out considerably larger. A short overview of present information on o"jv can be 
found in ref. |Sai05j . 
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6.3.2 First Numerical Experiments 

The empirical values for T^, and are to be compared with the chiral formulae 
evaluated at the physical pion mass, i.e. at 111^^ = m^^^. 

In a first step, let us assume that for and it is sufficient to look at the chiral 
expansions to order p^. No new parameters are involved at this order. (One might argue 
that contributions of higher orders should be negligible, since we evaluate at the physical 
pion mass, comparatively close to the chiral limit.) Therefore, we can readily evaluate 
the three observables with the parameters of the fits. Table 16.21 shows that T"'"^-^^ and 
p^^-^^ are completely incompatible with the empirical values. The empirical values lie far 
away from the systematic envelope.^ First of all, let us assume that corrections at order 
and higher can be made responsible. Evaluating the contributions of order < 3 using 
the central parameter values of Fit II, we get 

r+ = r+(^2) ^ 2^+(3) ^ T+i>4) L -0.22GeV-i 

= ( 6.4 + 0.86 + -7.4 ) GeV-i 

p+ = p+(<2) + p+(3) + p+{>4) i io44GeV-3 

= ( 922 + -323 + 446 ) GeV-^ 

where T"'"^-^) and P^^~'^^ have been chosen in such a way that the equations remain 

balanced. The magnitudes of r+(-^) and P^^-'^^ obviously come out so large that we 
cannot speak of a perturbative series any longer.^ 

To localize the origin of the discrepancy, we first determine the parameter ranges permitted 
by the empirical constraints without making use of lattice data. We do not apply the 
method here, because the empirical uncertainties may be correlated. We demand that 
each one of the quantities T^^-^\ p+(<3) evaluates to a value within its empirical 

error bound. Again, we eliminate ci by requiring the nucleon mass to hit the physical 
point, i.e. M^^\m^^^^) = M'^'^^. For e^, we permit a generous range between —10 and 
10GeV-^ and we allow values and in the range of eq. (|6.7|) . Table 16.31 shows 
that the permissible values of mo, ci, C2 comply well with our results from the lattice fit. 
Note however, that the constraints from pion nucleon scattering force C3 to take on a value 
large in magnitude, around — 5GeV~^. Other analyses fitting to data from pion nucleon 
scattering find similarly large values |FMOnj . 

We have checked that a tolerably good fit to lattice data is still possible with a value 
of C3 around — 5GeV~^. However, we will see in section [6.41 that an improved fit to a 
wider range of data using a finite volume correction favours a value C3 lower in magnitude, 
around our original input from NN scattering C3 ~ —3 GeV. What makes pion nucleon 



'^Note that these two quantities are very sensitive to and leading to a large systematic envelope. 

^We have checked whether fourth order coupling of "natural size" can account for such large corrections. 
Formulae of T"*" and at order can be found in [FMOOj . making use of T'+(-*) = 47r(l + m^/mo)aQ'^ + 
0{mT,^) and P+f^*) = 4tt{1 + m^/mo)(2a^"^ + af_) + 0(171^^^). For some of the LECs involved we found 
estimates from the literature. Assuming the 9 remaining LECs are scattered around zero following a 
Gaussian distribution, the standard deviation (i.e., the typical size) of these unknown LECs is « 5 GeV^'^. 
I leave it to the judgement of the reader whether this result can be counted as evidence for the breakdown 
of the theory or not. 



6.3 Consistency with Empirical Constraints 



75 



Table 6.3: Parameter values allowed by empirical constraints 
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scattering different? It turns out that the inffuence of the delta resonance can explain our 
observations. 



6.3.3 The Role of the Delta Resonance 
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Figure 6.11: (1) elastic vrA^ scattering, (2a), (2b) contribution from the delta resonance, 
(3) contact interaction vertex. 



An effective theory must encode the effects of excited states in its low energy constants. 
The excitation energy of the A(1232) resonance lies only A = 0.294 GeV ~ 2m^^^ above 
the nucleon mass. It is a spin-3/2 isospin-3/2 state. A pion-nucleon system can take on 
the same quantum numbers, so it can make the transition into a delta excitation. It has 
been known for a long time that this intermediate state plays an important role in the 
ttN scattering process |EW88) . Already within a simple non-relativistic treatment of the 
delta resonance, we can shed light on our issue concerning c^.^ 

Fig. 16.111 (1) depicts a general elastic pion nucleon scattering process. Here F and G are the 
pion isospin indices. Consider the graphs fig l6.11l f2a) and (2b). The double line symbolizes 
a delta resonance. We work in the static limit, discarding all corrections suppressed with 
l/rriQ. In the center of mass system the nucleon momentum is p ~ (rriQ, —q). The in- and 



am in debt to Prof. Wolfram Weise, who pointed this out. 
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out-going pion carries the energy uj = (m^ + q^)^^'^ = (rn^ + q''^y^'^. The non-relativistic 
vriVA-vertex and A-propagator can be written |KGW98l IEW88j 
F 

girNA c, rr h i 01 \ 



Here, the are spin matrices and the T-p are isospin matrices. The index i = 1..3 is a 
space coordinate. The spin tensor S couples the nucleon Pauli spinor and the momentum 
vector q to the spin of the delta. Likewise, the isospin tensor T maps both isospin of the 
pion and nucleon to the isospin of the delta. The intermediate state must carry exactly 
the same quantum numbers as the system of incoming particles.^ From Clebsch- Gordon 
algebra it follows |K(;W98I IEW88] 

-5*1 5"! = - -^eykCTk , TqTI = ^5gf - -^ecFHAH (6.22) 

Since we are working in an SU(2) framework. Ah are simply the Pauli matrices. The 
amplitude of the "direct" process fig. 16.111 (2a) is set up as 

T-ta,,') ^,y. (-|^s„;rc) (^.,;)_„.^ (-It^W'^O** <«-^'' 

with X x' denoting the Pauli spinors of the nucleon in the initial and final 

state, respectively. From the propagator, the diagram acquires an energy denominator 
[{p^ + q^) — ttt-a]^^ = [oj — A]^^. In the crossed process fig. 16.111 (2b), the energy de- 
nominator becomes [—oj — A]^-*^. The amplitudes of diagrams (2a) and (2b) add up to 

-2u;(5ij iecFHAn +AieijkO-kieGFHAH )x 

We focus on the first term in the above sum. It constitutes a non-spin-flip isoscalar 
P-wave.'^ Thus we identify the delta contribution at threshold oj = m-,^ to be 

p+(A) _ 2g^jvA A _ 2gl^^ f ml \ 
1 9ml A^- ml Om^ A V A^ ^ ' 

BxPT encodes the effect of the delta excitation in contact interactions like fig. 16.111 (3^. 

(2) 

Possible candidates for interaction terms with two external pion lines can be found in C^j^, 
^^nN^ etc. The amplitude of contact terms from C^^j^ reads 

^'^^ u(p') q' ■ q u{p) 6gf H 277o^2 ^^P"> W ' P ^1 ' P + 1' ' P' ^ ' P') ^ip) ^gf 



static limit li 

'X ' 



2(C2 + C3) 2_ J^^/ 
if 0)2 ^ (/0)2« 



X Sgf (6.26) 



''The strong interaction conserves spin and isospin. 

^It is proportional to q' ■ q, which makes it a P-wave in the partial wave decomposition, and it leaves 
spin and isospin invariant. 
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where we have used u{p) 



X 

2mo 



The second term in the static hmit, proportional to C3, is again an isoscalar non-spin-flip 
P-wave.® This term must generate the entire isospin-scalar non-spinflip P-wave amplitude 



-(A) 



of the delta. 



Here we come across an explanation for our observed discrepancies. The LEC C3 should 
be determined (if it were possible) in the limit m,r — *■ 0. Using B^PT to order p^^ a 
hypothetical pion nucleon scattering P-wave analysis executed in the chiral limit would 



then measure C3 precisely , while a real world experiment carried out at rn-jr = m 
a modified value £3. Comparing eq. (|6.25j) and (|6.26j) . we find 

A ^ C3 A2 

A2 _ (mf y^)2 ' ^ C3 



phys 



yields 



9mg A 



C3 = - 



9ml 



A2 



[m 



phys X 2 



1.3 



(6.27) 

Of course, the m7r-dependence of Pj^*-^^ can be encoded by BxPT as well, but this would 
require terms from >C^^ . The large number of coupling constants appearing in a full fourth 
order calculation make it difficult to produce a reliable fit to data IFMODj . Moreover, the 
higher order terms would be large. This is due to the fact that A is much smaller than 
A-^, so that the corrections of order m^/A^ in eq. 16.251 do not fit properly into our power 
counting scheme. 



p-q 



1 



(4) 



(5a) 



(5b) 



Figure 6.12: (4) nucleon self-energy contribution from the delta resonance, (5a) (5b) single 
delta excitation in A^A^ scattering. 

It is interesting to check what happens to the contribution associated with C3 in our 
nucleon mass formula eq. (|2.79|) . Here, we must compare to a loop graph involving a delta- 
resonance in the intermediate state, see fig. 16.121 f4). Inserting a relativistic propagator 
for the pion, the amplitude reads 



-gnNA 



(27r)4 V 2mo 



SttNA 

3 ml 



g2 _ _|_ ^QH 



(P° 



(27r)4 (gO + cj - iO+)(gO - w + iO+)(gO + A - iO+) 



d^q 



3ml J (27r)3 2a;(a; + A) 



(6.28) 



^Naturally, we find the contribution -2ci/{f°f also in the HBxPT resuh P+'^2) of eq. iflT^ . 
^Let us assume here that the delta mass shift A is only weakly dependent on m-^, and that diagrams 
16.111 i2a). (2b) constitute the dominant contribution to . 
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In the last line, we have integrated with respect to in order to make the energy denom- 
inators visible. For comparison, the amplitude of the tadpole graph of fig. 12.21 (b) created 
from the C3-term is 



(/0)2 7 (2vr)3 2.. 

The part proportional to must generate the contribution of the delta. However, the u 
dependence of the energy denominator in eq. (|6.28|) is different. Suppose for a moment 
we had obtained an energy denominator [2u;A]~^ in eq. 1)6. 28() . Then we could encode the 
delta loop diagram perfectly in the tadpole graph, by setting the contribution to C3 from 
the delta resonance to^'' 



J A) _ ff^TVA (/tt 



0\2 



9mg A 



-3.3 GeV"^ (6.30) 



The actual energy denominator [2a;(a; + A)]~^ makes higher order corrections necessary. 
Contrary to the case of ttA^ scattering, uj is added to A in the denominator. Consequently, 
the contribution of the delta loop amplitude decreases with rising pion mass. We should 
expect that smaller values of C3 work better for fits with the nucleon mass formula. This 
is indeed what we observe. 

How do the effects of the delta dynamics show up in A^A^ scattering? According to ref. 
|K(tW98] . an interesting coincidence occurs for the isoscalar central and isovector ten- 
sor channel of the graphs IB. 121 (4a) and (4b). Here, the energy denominators of the two 
graphs add up to the simple expression 2[Aa;i C02\~^, where uji and L02 are the energies of 
the exchanged pions. This combined energy denominator looks like a composition of two 
pion propagators and an energy independent delta propagator. Fixing C3 from the ampli- 
tude of these graphs gives the same result in the chiral limit and at the physical point. 
Unfortunately, other channels of A^A^ scattering and contributions from double-delta ex- 
citations do not share this convenient feature. Nevertheless, it adds to the plausibility of 
the A^A^ result C3 = — 3.4GeV~^ from ref. |EMn2j . that the corresponding "biased" value 
C3 = — 4.4GeV~^ is already in the range of results from vrA^ scattering analyses. 

As demonstrated in ref. |HPWn3] , attempts to fit lattice data of the axial vector coupling 
gA_ show even much more dramatically that the theory fails without inclusion of the delta 
fields. From the considerations above, it is easy to understand why there are observables 
which are affected particularly strongly by the delta. As soon as a process of the direct 
type (fig. 16.111 (2a)) is allowed by the symmetries, the corresponding energy denominator 
enhances the influence of the delta as [m.,^ — A]^^. With pion masses close to the delta 
mass, as we find them in lattice data, this factor explodes. In other words, "the delta 
resonance goes in resonance." 

Even at the physical point, the effects of the delta dynamics can be quite significant 
numerically, as we saw in the case of pion nucleon scattering. Encoding them in large 
higher order contributions is problematic. 

The inescapable conclusion is that delta degrees of freedom should always be in- 
cluded explicitly in Chiral Perturbation Theory with baryons. 



^"For the numerical estimate, we use g^NA ~ 28, mo ~ 0.88 GeV. 
"Compare e.g. to £3 = -4.7 ± 1 GeV from |BMOO| 
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For the nucleon mass, the extension to a framework including delta fields has been done, 
and fits to lattice data are possible, see appendix 



6.3.4 Constrained Fit with a Modified C3 

For the present analysis, we use as a working approximation the modified value C3 = 1.3 C3 
as input to the pion nucleon scattering formulae, which we consequently denote T'"'"^-^) and 
p+(<3)^ Firstly, we reevaluate the parameter bounds permitted by empirical constraints. 
The result is shown in table 16.41 Among these parameters, we select the set with the 
smallest with respect to the four lattice points, see column (b) of table 16.41 



Table 6.4: Parameter values allowed by empirical constraints. 





(a) constraints alone 


(b) lattice fit 


mo 


(GeV) 


0.871 . 


0.897 




0.882 


Cl 


(GeV-i) 


-1.19 . 


-0.68 




-0.91 


C2 


(GeV-i) 


2.2 . 


4.0 




2.95 


C3 


(GeV^i) 


-4.2 . 


-3.3 




-3.65 


ef)(lGeV) 


(GeV-3) 


-10 


. 10 


input 


1.88 


Sa 




1.1 . 


1.3 


input 


1.1 


J TT 


(GeV) 


0.0862 . 


0.0924 


input 


0.0924 


r+(<3) 


(GeV^i) 


-0.23 . 


-0.01 


input 


-0.01 


p+(<3) 


(GeV^3) 


1006 . 


1082 


input 


1006 


UN 


(GeV) 


0.037 . 


0.053 


input 


0.050 






0.31 



Note that most input constraints in column (b) of table are active, i.e. the best fit to 
lattice lies on the border of the region we permit. An exception is the sigma-term cjat, which 
comes out beautifully at 0.050 GeV, very close to the value obtained in our previous fit, see 
tableinm There is no reason to be concerned about the fact that the constraints are active. 
The four lattice points are simply far too little input to determine the seven parameters 
entering the nucleon mass formula. Therefore, the constraints provide essential additional 
input, and we see them actively influence the fit result. The drawback of a constrained 
fit of the above kind is that it becomes much more difficult to estimate errors, which we 
therefore have not done. C3 now assumes a value quite close to the A^A^ scattering result 
C3 = —3.4 GeV. This is only possible, because thanks to our delta correction, T'^ and 
are evaluated with the modified value C3 = 1.3(— 3.65) GeV = —4.7 GeV. 

The quality of the curve produced with the new parameters is formidable. The dashed 
line in fig. 16.131 passes through all the lattice data points, see fig. 16.131 Moreover, the 
curve lies right in the center of the systematic envelope of our original setup in section ing 

In conclusion, the introduction of strong constraints from vrA^ scattering leaves our inter- 
polation curve practically invariant, if we take effects of the delta resonance into account. 
We thus obtain a consistent picture and have good reasons to be confident about our 
results. 
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Figure 6.13: Fit to lattice data with empirical constraints T~^(--^\ p+(<3) ^^^^ ^or 
comparison, we plot again the systematic envelope of figure 



6.4 Finite Volume Effects 

6.4.1 Basics of the Finite Volume Correction 

Lattice calculations are carried out on a box of rather small side length, which causes 
deviations in the calculated observables. Chiral Effective Field Theory can describe the 
effects of a finite simulation volume on the nucleon mass. This way, more lattice data 
become accessible to xPT. 

We still assume the time dimension to be so large that we can take it as infinite. Based on 
the restriction of pion loop momenta to the discrete values permitted by periodic boundary 
conditions, calculations for the mass difference caused by finite simulation volumes done 
AK"'"04| yield the following contributions at order and 



m 



A(3)m;v = ^J^2^^l^dx Yl Ko^L\n\^mlx^ + mlil-x)^ (6.31) 

A (4) 3m^ ^ / Ki{L\n\m^) K2iL\n\m^) \ 

nez»\{o} 

where Kq and Ki are modified Bessel functions. 

We notice that these formulae introduce no new parameters, and that ci, C2 and C3 appear 
in linear combinations different from those in the infinite volume formula (|2.79j) . Thus 
volume dependence can provide an entirely new kind of constraint on the fit parameters. 



6.4 Finite Volume Effects 
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Originally rated as an undesirable artefact, finite volume effects be exploited to gain 
essential new statistical information. We explore this possibility by fitting lattice data to 
the volume corrected nucleon mass 

mj^^^(m7r, L) = mj^^-'(m^) + A*^'^^mAr(m^, L) + A'^^^mAr(m7r, -L) (6.33) 



6.4.2 Fast Implementation of the Finite Volume Correction 

In the course of fitting and statistical analysis, rn"^^\mT^ , L) must be evaluated numeri- 
cally a large number of times. Therefore it is necessary to have a fast implementation of 
the volume correction formula. First of all, we cut off the sums at = 8 and = 6 in 
formula (|6.3H) and (|6.32|) . respectively. The integral in formula (|6.3H) must be determined 
numerically. We speed up its evaluation with our own problem specific implementation. 

First of all, only the length |n| enters the sum over n. Thus for each integer k = n'^, we 
count the number Nf^ of thee dimensional vectors n with integer coordinates and store 
these coefficients in a look-up table. The integrals in (|6.31|) are split into two parts 

4 = / dx Ko{LVk^Jmlx^ + ml{l - x)) (6.34) 
Rk= dxKQ{Ly/k^^mlx'^ + ml{l - x)) (6.35) 

J Sk 



Ik is evaluated numerically. The cutoff has to be chosen so that the tail of the integral 
Rk is negligible. To estimate Rk, consider the asymptotic expansion 

Ko{z) for z ^ oo (6.36) 

For z > 7r/2 we have Kq{z) < . Furthermore, we approximate 

\ mHx^ -|- m^(l — x) « ra^x if a; > 1 and x ^ — ^ (6.37) 
V mo 

The latter condition is superfluous in our case. If the requirements are met, then 



j'OO 

Rk~ \ dx KQ{LVkmox) 

J Sk 



'Sk 
r>00 



< I dx exp{— LVkmox) = — j= — exp(— L\/fcmoSfc) (6.38) 
Jsk LVkniQ 

Rk must be much smaller than the final integral, which we estimate to be of the order 

POO r^z~ ^ 

ik^ dxj—e'' = ^= (6.39) 

" Jo y 2z V2kLmo 

where we have made use of the asymptotic behavior eq. 1)6. 36() and approximated the 
argument of the Bessel function according to (|6.37|) hy z = L^/kmQX. Demanding Rk < 
10~^/fc and enforcing the validity conditions of our approximations x>l, z>7r/2 gives 

Sk = max\-^ In (l Vfc mo 10-^ /fc) , 1, j (6.40) 
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Now the Ik are evaluated using Simpson integration with 11 samphng points each and 
summed up. 



6.4.3 Numerical Results 



Accepting lattice data for pion masses up to 0.65 GeV, we extend our lattice input data to 
the first 10 points in table IbTTI providing us with two volume groups for which simulations 
in smaller lattices exist. The pion mass m,r entering formula (|6.3.S)) is the mass of the 
pion in the infinite volume. Within the volume groups of table IB. II an infinite volume 
pion mass estimate can be taken from the pion mass in the largest volume, underlined in 
the table. Ref. |OLS05j demonstrates that this procedure is sensible. We also copy the 
error of the pion mass from the largest volume. This is only a working assumption and 
not entirely correct because in our analysis we treat the pion masses of lattice points 
within a volume group as statistically independent quantities. 

A weak point of our preceding study of large volume data was fixing C3 to a value still 
under debate and neglecting its uncertainty. With finite volume data as a new source of 
information, we can now release C3 and determine it from the fit. Also, we accommodate 
our systematic uncertainty about C2 in a range from 3.1 GeV~^ to 3.3GeV~^. This range 
encompasses results from several HB^PT fits at 0{p'^) to experimental vrA^ scattering 
data, see table 4 in |FM00j . 

The results of our calculations are summarized in table 16.11 and figs. 16.141 and 16.171 
Looking at the latter figure, we rate convergence of our finite volume correction acceptable. 
Notably, C3 comes out in a range —4.3.. — 1.4 GeV~^ , i.e. for finite volume data, a value of C3 
low in magnitude and compatible with NN scattering results of ref. |EM02j is preferable. 
As a consequence, convergence on the m.,^ axis in fig. 16. 181 appears to be optimal. 



Table 6.5: parameter values and observables for the finite volume fit 





(a) statistical error 


(b) systematic envelope 




(GeV-3) 


2.40 ± 0.36 


fitted 


0.8 . 


. 3.6 


fitted 


ca 


(GeV-i) 


-2.9 ± 0.6 


fitted 


-4.3 . 


. -1.4 


fitted 


mo 


(GeV) 


0.884 ± 0.006 


fitted 


0.873 . 


. 0.898 


fitted 


ci 


(GeV-i) 


-0.88 ± 0.09 


elim. 


-1.03 . 


. -0.69 


elim. 


Sa 




1.267 


fixed 


1.10 . 


. 1.30 


scanned 


J-K 


(GeV) 


0.0924 


fixed 


0.0862 . 


. 0.0924 


scanned 


C2 


(GeV-i) 


3.2 


fixed 


3.10 . 


. 3.30 


scanned 


xVd.o.f. 




0.75 




0.69 . 


. 0.82 




f.+(<3) 


(GeV-i) 


5.1 ± 2.3 




-2. . 


. 15. 




p+(<3) 


(GeV-3) 


690. ± 160. 




180. . 


1190. 




(Tat 


(GeV) 


0.0476 ± 0.0047 




0.036 . 


. 0.057 






(GeV-3) 


2.54 


fitted 
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Figure 6.14: Fit to finite volume lattice data. All the lattice points shown are used as 
input for the fit. The points drawn as black diamonds are the calculations on the largest 
lattice of each volume group and can practically be treated as infinite volume results. The 
lattice points represented by brown triangles exhibit a sizable finite volume effect. They 
are drawn at pion masses copied from the "large volume" result of their group. The fit 
function, the 68% statistical error band and the systematic envelope are plotted in the 
infinite volume limit. 
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Figure 6.15: CL = 68% 3-parameter confidence region for the finite volume fit. Color 
scheme as in fig. 16.41 The systematic envelope reflects uncertainties in g^, and C2- 
Panes a), b), c) show projections on the different planes of fit parameter pairs. In panes 
d), e), f) the confidence region is translated to depict the statistical relation of C3 to aN, 
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Figure 6.17: L-dependence of the fit function and error bands at the pion masses of the 
two volume groups. The shapes and colors of the lattice points correspond to those of 
fig. 16.141 The horizontal dashed line is the infinite volume limit of the fit function. The 
bent dashed lines represent the 0{p^) function, where ep^(A) has been determined from a 
secondary fit to data. 
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Figure 6.18: Study of convergence: nucleon mass predictions at order , and p^, using 
the parameters determined from the finite volume fit. At order p^, ep^(A) cannot be 
imported from p^ and needs to be determined from a secondary fit to the data. 



Chapter 7 

Chiral Extrapolation of Meson 
Mass Lattice Data 



As long as excitations of multiplet states involving valence strange quarks are not in 
resonance during the process under study, we have the choice to pick either a two-flavor 
SU(2) or three-flavor SU(3) framework for our calculations. The two frameworks are 
related to each other via matching conditions: In going from SU(3) to SU(2), we find 
that couplings and masses in SU(2) contain contributions from the strangeness degrees of 
freedom. The MILC calculations jA+n4al iR+mj offer data on meson and baryon masses 



in fully dynamical three flavor lattice QCD. The quark masses employed are again isospin 
symmetric, m„ = rrid = m, and the strange quark mass anis is placed at a value that 
reproduces roughly the physical strange quark mass m^^^^ . 

For MILC data, performed with staggered Fermions, the lattice renormalized quark masses 
directly enter as simulation parameters. This way, they are known to us precisely. One 
can employ them as input in a simultaneous fit to pion and kaon quark mass expansions 
from xPT. 



The work done in ref. A"'"f)4b are meson and quark mass fits of great complexity. The au- 
thors apply Staggered Chiral Perturbation Theory to describe accurately the spurious 
states in the mass spectrum resulting from staggered Fermions. They address discretiza- 
tion errors and finite volume effects, and work at chiral order p^. As a drawback, they 
have a large number of fit parameters. It is therefore interesting to compare to a relatively 
simple fit. Note however, that our analysis suffers from possibly large systematic errors in 
its input data. 



7.1 Matching Conditions 



We employ the GMOR pion mass = 2Bm to parametrize the light quark mass rh. 
Therefore, it is vital to understand the link between SU(3) and SU(2) in the meson sector, 
before we address baryon masses. 

With the renormalization invariant quantity ttik = Bnis, we write the SU(3) GMOR 
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relations in the following form: 



2 



2Bm 



m 



K 



mt 



Brh + BrUs 

2 . 4 
—Bm H — Bm^ 

3 3 



J- — 2 I - 2 

1-2^4.2 



(7.1) 



With the definitions 

Jl{\)^2Ll{\)-Ll{\) 



Jl{\)^2Ll{\)-Ll{\) 



(7.2) 



2 



2 

m 



the SU(3) pion and kaon mass to order read |GL85j 



1 +15M±^4(,) + 



— 2 , ~ 2 



2 
3' 



8m| 



(7.3) 



Fi(m2,rn2^) (7.4) 



2 16(m^+m|^) 4^^ + 8m|^ ' 
i + H ^^0^2 -^4 l-^J H ;r^o\2 ^3 l^J 



(7.5) 



We should recover the SU(2) pion mass formula if we expand for m <C nis or, equivalently, 



1 + 



ft: 



0^2 



16(4^)2 JI(A) - ^In 



4m 



K 



9 3A2 



+ 



(47r/0)2 



8(4vr)2 JJ(A) + 16(4^) VI(A) - - f 1 + In ^ 



1 , mi 
+ 2^"# 



+ C(m^,/^^ 



(7.6) 



Let us underline all quantities in the SU(2) framework. They can effectively absorb con- 
tributions from strangeness. The SU(2) GMOR relation simply reads = 2Bm. Chiral 
perturbation theory does not give us matching conditions for S and m individually, since 
the quark masses m are renormalized within a framework outside the scope of xPT. At 
this point, we cling to the concept of ref. |GL85j . choosing m = m. We could not do this, 
if we ever wanted to compare to two-flavor lattice quark mass data, because then SU(2) 
lattice regularization would impose the quark mass concept to be used for m. Comparing 
the term proportional to in eq. dZSl) to the right hand side of GMOR, one finds |(;h85j 



ml 



2Bm 



B-2 



B 
B 



1 + 



m 



K 



16JI(A) 



9(47r) 



4m|, 
3A2 



(7.7) 



In the following, the correction = + 0{m?j^) can be ignored at our working order. 
Comparing eq. (|7.6j) to the SU(2) pion mass expansion eq. (|6.3j) . one reads off |(TL85j 



ii(A) = 4J3^(A) + 8Jl(A) 



36(47r) 



1 + ln 



Am 



K 



3A2 



+ 0(m^,i^^) 



(7.8) 



7.2 Fit Procedure 
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7.2 Fit Procedure 



The renormalization scheme changes for each set of lattice simulation parameters, mainly 
depending on the lattice spacing a. In general, quark mass parameters are not mutually 
comparable for the different simulations. Thanks to the farsighted choice of simulation 
parameters, a large number of MILC calculations has been performed at the same lattice 
spacing. Here, we shall pick a data set from lattice results for a ~ 0.12 fm; the quark 
masses therein belong to the same renormalization scheme. For our low precision goal, 
the tiny residual scheme dependence on the sea quark masses is negligible, compare ref. 

n+niE. 



As input we take a/ri, am, arris, am-,^, and arriK for 5 data points with pion masses from 
0.257 GeV to 0.610 GeV, see table IB. 2l Note that the data available t o us are those of the 
lightest taste of each particle^, as tabularized in refs. |A"'"04al lB"'"01 . The strange quark 



mass on the lattice has been kept fixed at anis = 0.05 in our data set. The conversion of 
a mass mx to physical units is done according to 



mx = , I , — he (7.9) 

(a/ri) ri 



where we take r\ = 0.324(4) fm, the value specified in ref. |A"'"04aj for the lattices with 
a ~ 0.12 fm. The error Ari = 0.004 fm in r\ is systematic, i.e. it does not fluctuate for 
each lattice point. We do not include it in our formulation of x^- Instead, we treat it 
together with the uncertainty in and carry out the analysis for several values within 
the error bounds. Ref. jA"'"04a offers "smoothed" values for ri/a. We prefer to use the 
unsmoothed values, because any statistical fluctuations should be accounted for in our flt 
to chiral perturbation theory. 

The fitted parameters are B, J^{X) and Jl{X). Resubstituting again m.,^ 2Bih and 
niK — > Bms, we have two flt constraints for each lattice point from eq. (|7.4j) and (|7.5|) . 
For each lattice point, the constraints take the following form (in symbolical notation): 



" fi{am,ams,a/ri+5{a/ri);B,J^,Jl) am^ it A(a?TT.7r) 



f2{am,ams,a/ri+6{a/ri); B, J'^, Jl) amx ^ ^{amx) " i^-^^) 

6{a/ri) PS ± A(a/ri) " 



Here am^, a/ri and amx denote the central values of the lattice calculation results, and 
the quantities preceded by A are one-standard-deviation errors. The auxiliary parameter 
6{a/ri) can be eliminated using the formalism of section [4.91 assuming fi and /2 to be 
approximately linear in 6{a/ri). 



90 



Chiral Extrapolation of Meson Mass Lattice Data 




Figure 7.1: Simultaneous fit to MILC data for pion and kaon masses mT^^niK versus light 
lattice quark mass m. The inner bands (here barely visible) represent the statistical error 
for = ft^^^ , the green outer bands also take into account uncertainties about and 
the lattice scale ri. 

7.3 Fit Results 

The solid lines and the error bands in figs. 17.11 and 17.21 show the input data points and 
our fit curves expressed as functions of rh and m^. The steps in the fit curves, visible in 
particular in the kaon curve, are due to the fact that we take from the closest lattice 
point for the plot. Even though arus = 0.05 is constant, varies slightly with a/ri. The 
location of the lattice points has been converted to physical units using the central value 
of ri . The inner error bands correspond to statistical uncertainty for = f^^^^ and ri at 
its central value. The light green outer error bands arise from systematic errors, namely 
from performing the analysis with 5 different values of in the range 1)6. 7() . and with ri 
taking on its minimal and maximal values on the error bounds. In fig. 17.11 we plot 
and m?^ versus rh. The GMOR relations, which predict a linear rise of and m?j^, are 
astonishingly well fulfilled. One can hardly notice the slight deviation of our fit curves 
from linearity. To make these deviations visible, we plot in fig. 17.21 the quantities 

Q.^^, Qk^ (7.11) 
2m m + rus 

For our fit, we obtain a x^/d.o.f. of 82, which corresponds to an 8a discrepancy. Obviously, 



^Many thanks to Dr. Claude Bernard for bringing this to my attention, and for giving advice on 
conversion to physical units. 
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Figure 7.2: Simultaneous fit to MILC data for pion and kaon masses m-,^ versus light lattice 
quark mass rh. Plotting the quantities Qtt, Qk permits better judgement of the quality 
of the fit. The inner bands represent the statistical error for = fw^^^, the green outer 
bands also take into account uncertainties about and the lattice scale ri. 



the input uncertainties are underestimated. This is no surprise. We have neglected sig- 
nificant artefacts from the finite simulation volume, from discretization, and certainly we 
need to distrust our perturbative expansion of order at larger quark masses. As a result, 
the statistical and systematic errors and error bands displayed are definitely not realistic. 
Nevertheless, they reflect the sensitivity of the fit results to the data. When excluding, 
for example, the point of lowest rh, or when modifying the weights of the points, we see 
some slight changes in the fit curves, visible especially in fig. 17.21 However, some general 
features remain invariant, in particular the long arc describing the pion mass deviation 
from GMOR in figs. OandO 



7.4 Extrapolating to Physical Conditions 

Can we locate the physical point E"'"04j 



phys = 0.138 GeV , m^^^' = 0.494 GeV (7.12) 



in our plots? We know that the value amg = 0.05 is only a guess, placing the strange 
quark mass somewhere in the vicinity of its physical value. Once we have extracted B, 
J3 and J4, it is easy to extrapolate to the true strange quark mass. To that end, we need 
to invert equations (|7.4|) and (|7.5j) . An approximation at working order can be formed by 
substituting mj^ — > inj^ — and in the square brackets and loop terms of 
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Table 7.1: Parameter values and calculated quantities for the meson fit to MILC data 
(errors not reliable!) 





(a) statistical error 


(b) systematic envelope 


B 




(GeV) 


4.153 ± 0.026 


fitted 


4.04 . 


. 4.23 


fitted 


J3(A = 1 GeV) X 






-0.0136 ± 0.0043 


fitted 


-0.028 


. 0.019 


fitted 


J4(A = 1 GeV) X 






0.021 ± 0.006 


fitted 


0.011 . 


. 0.039 


fitted 


J TT 




(GeV) 


0.0924 


fixed 


U.UoDZ . 


n 009/1 


scanned 


n 




(fm) 


0.324 


fixed 


0.3200 . 


. 0.3280 


scanned 


xVd.o.f 


82 


75 .. 


136 




^3 






3.882 ± 0.032 




3.75 . 


. 3.92 




B/B 






1.0553 ± 0.0025 




1.050 . 


. 1.074 








(MeV) 


2.229 ± 0.009 




2.192 . 


. 2.267 




phys 

rris 




(MeV) 


61.88 ± 0.19 




61.0 . 


. 63.5 




- phys 




(GeV) 


0.5070 ± 0.0009 




0.5059 . 


. 0.5086 





eqs. (|7.4|) and (|7.5|) . Solving for rnj^ and m^, we obtain 



(4vr/0)2 



8(4vr)2(2Jl(A)m| + JliX)ml + Jl{X)ml) 



2 _ 2m\ - ml 1 
— 7, r 



4m|- - 



3A2 



2Brh = Gi{mi,mii) (7.13) 



(4vr/o 



0\2 



4(4vr)2(J^(A)m^ + Jl(A)m^ - 2J3^(A)m^ - AJl{\)m\) 



+ 



mi — 16m|^ , 4m|, — mi mi . m: 



36 



In- 



4 A2 



3A2 



Bm,^G2{mi,m],) 



(7.14) 



We can assess the quality of the approximation by resubstituting the results into the 
original formulae. In particular, evaluating F2(Gi(m^, m^), G2(m^, m|^)) at m^^^ and 



m 



^^^^ , we should obtain {mF^^^Y again, but in fact, the resulting kaon mass is 1.5 MeV 
smaller than the input. This error is also an indication of the accuracy of the perturbative 
expansion. We would like to have a more accurate inverse, because otherwise visible 
spurious offsets between extrapolated quantities would appear in the plots. We define 
residuals 



phys \ 2 



R,{mlml) 
R2{ml,ml) 



Fi{Gi{ml,m\),G2{ml,m]^)) - ml 
F2{Gi{ml,m\),G2{ml,m\)) - m\ 



(7.15) 



The iteratively corrected expressions 



Gi{ml,m]i) = Gi{ml - i?i(m^, m^), m|- - R2{ml,m\)) 



(7.16) 



perform better by three orders of magnitude for the kaon. They still constitute differen- 
tiable Mathematica expressions of the fit parameters, which can be processed by our error 



7.5 Making the Connection to SU(2) 



93 



analysis tools. ^ From Gi{ (m^^^)'^, (m^^^)'^ ) we calculate ifi^^^^ and m^^^'^ as shown in 
table 17.11 Note that they are lattice regularized. Now we also know where to place the 
markers of the physical point in our plots. Finally, we are able to plot as dashed lines the 
light quark mass dependence of pion and kaon mass for rri^^^^ . 



7.5 Making the Connection to SU(2) 



1.05 



0.95 



0.9 



0.85 



SU(2)pion 

— SU(3) pion (tuned mj 
▲ SU(3) pion lattice data 
physical point (inferred) 




0.1 



0.2 



m 



0.3 

2 



0.4 



0.5 



0.6 



[GeV^ 



Figure 7.3: The SU(2) GMOR correction factor i?^ as inferred from the simultaneous 
SU(3) meson fit. The solid line and error bands are plotted using the SU(2) formula 
()6.3|1 . The inner band represents the statistical error for = f'^^^^ , the green outer bands 
also take into account uncertainties about and the lattice scale ri. The dashed curve 
represents the SU(3) prediction using the tuned strange quark mass. (For the pion, the 
effect of this tuning is tiny.) 

The fit results can be transferred to the SU(2) framework using the matching conditions 
of section ITtI The value of central interest is I3, see section 16.2.1 1 Even though the errors 
in table [7T1 are definitely underestimated, a value of 



(7.17) 



seems to be a stable result. This is in good agreement with the value 2.9 it 2.4 from ref. 
|(;L84| . In fig. O we plot the SU(2) translation of our fit result in the same way as in fig. 
16.21 Mind that the location of the lattice data shown in this plot depend on the results of 



^Numerical evaluations at intermediate steps with simple error propagation are forbidden, because they 
lead to "double counting" of errors. 
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our fit. The curve indicates that in the range of available lattice data, ~ 0.9 m^, i.e. 
the SU(2) Gell-Mann - Oakes - Renner relations need roughly a 10% correction. 



7.6 Comparing to Other Works 



Converting all values to our renormalization scale A = 1 GeV using 



1 . Ao 
6(47r)2 A 



JI(A) = JI(Ao) + 



1 , Ao 
36(47r)2 A 



(7.18) 



we take values for the from |BE(T94j . Adding errors linearly, one obtains J3 = (0.7 ± 
1.1) X 10^ JX = (-0.1 ± 1.1) X 10^. The results presented in |A+04bj are = (0.43 ± 
0.10 ± 0.20) X 10^ and = (0.39 ± 0.20 ± 0.40) x 10^. Using the matching condition 
()7.8() with niK from eq. 1)7. 14() . we find that this corresponds to I3 = 0.8 it 1.1 it 2.6. As 
illustrated in fig. 17. 4[ these values are in good agreement with our results.^ We refrained 
from drawing an error bar for our result. A more detailed analysis would be needed to 
obtain a fair error estimate. We also find agreement with results for and Lg from ref. 
|ABT01j . which yield = (-0.58 ± 0.7) x 10^. 



/4XIO' 
1 

0.5 



-0.5 
-1 



^ this analysis 
^ Bijnens et al. 
A Aubin et al. 



/3XlO^ 



Figure 7.4: The values of J3 and J4 at the renormalization scale A = 1 GeV as calculated 
in the study at hand (without error bar), by Bijnens et al. |BEG94j and Aubin et al. 
A+fUhl. 



A 
A 



'^Our fit results are so close to the origin only for our particular choice of the renormalization scale 
We checked that performing our study with A = 2 GeV and applying eq. H7.18|l to convert back to 
= 1 GeV produces exactly the same results. 



Chapter 8 

Conclusion 



8.1 Summary 



In this work, wc have thoroughly investigated the feasibihty, rehabihty and perspectives 
of lattice extrapolations. Our main focus has rested on the nucleon mass. The following 
conclusions can be drawn: 



• As long as we fix a subset of parameters and constrain the fit curve to run through 
the physical point, we obtain a stable interpolant in the whole assumed range of 
validity of Chiral Perturbation Theory. The error band remains narrow from a pion 
mass of 0.6 GeV down to the chiral limit. 

• The effect of uncertainties about the chiral limit values and is marginal for 
the interpolation curve, but appreciable for the extracted low energy constants. 

• We have confidence that higher order effects are sufficiently suppressed. Unfortu- 
nately we cannot pursue a very stringent analysis at this point. This is due to the 
fact that the set of fourth order couplings subsumed in ei differs in the calculation 
to order and p^. 

• More indications for good convergence come from a numerical analysis replacing 
by the pion mass dependent gA- 

• With more lattice data becoming available over time, ab-initio extrapolations of 

lattice data come into reach. However, our nucleon mass formula alone will not be 
sufficient to make extrapolations purely from lattice data. The problem is inherently 
badly conditioned. Hope lies in simultaneous fits to a series of observables. 

• Using xPT formulae of order for pion nucleon scattering, we observe discrepancies 
with empirical data. The problem has been identified as an effect of the delta 
resonance. Future Chiral Perturbation Theory calculations with baryons 
should take the delta resonance into account explicitly. We have presented 
a well-motivated workaround that can restore integrity in our particular scenario. 

• Using a correction formula, we have been able to include lattice data for finite 
volume side lengths down to L = 1 fm. We have found that the gain in extractable 
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information is substantial, because the finite volume dependence provides an entirely 
new kind of constraint. 

In a second step, we have begun analyzing three-flavor lattice data. Chiral extrapola- 
tions rely on the pion mass to parametrize the quark mass. Using SU(3) pion and kaon 
mass formulae at chiral order simultaneously, we have fitted lattice data from MILC. 
Matching conditions have enabled us to infer SU(2) results. 

• Despite negligence of some sources of systematic errors, the fit formulae can be 
brought in satisfactory agreement with the lattice data. 

• Effects of systematic discrepancies are reflected in a large x^. The calculated errors 
are therefore underestimated. 

• The extracted low energy constants are compatible with results from other groups. 

• When performing simultaneous flts with nucleon mass data from SU(3) and SU(2) 
calculations, a relative factor B_/ B will have to be taken into account matching the 
quark mass parametrizations via the SU(3) and SU(2) pion masses. According to 
our fit, this amounts to a difference of about 6%. 

• We determine the SU(2) coupling constant I3 k, 3.9. 

• Present day lattice extrapolations ignoring corrections to the Gcll-Mann - Oakes - 
Renner relation underestimate the quark masses by roughly 10%. 

8.2 Outlook 

Even today, we still have rather incomplete knowledge of the low-energy constants of 
Chiral Perturbation Theory as extracted from phenomenology. The broad range of values 
for the low energy constants found in the literature stems from many isolated studies on 
small sets of observables. Yet effective field theories, such as Chiral Perturbation Theory, 
display their power in the description of many different phenomena with the same set 
of parameters. Our success with the finite volume fit demonstrates the advantage of a 
global analysis. The great task for the future will be a simultaneous fit, combining 
chiral formulae for many different observables in a single x^, applied to a broad range 
of statistically independent data from experiment and lattice. The combined fit has the 
potential to produce precision results and reliable error information of a new quality. 



Appendix A 

Comparison to Other Frameworks 



The statistical error band allows us to compare our results to calculations performed within 
other theoretical frameworks. 

We have seen that the influence of the delta resonance plays an important role in baryon 
chiral perturbation theory. Massimiliano Procura has calculated and applied nucleon mass 
formulae calculated within manifestly covar iant SSE PMHl]. Fig. EH] offers a ghmpse 
at the results, and a comparison to our systematic envelope. 




/ 



★ 


physical point 




lattice data (input) 




1. data, m^>0.6GeV 




envelope, w/o expl. A 




fit 1, with expl. A 




fit 2, with expl. A 



0.3 0.4 
[GeV^] 



0.5 



0.6 



Figure A.l: Best fit curves based on formulae at order in manifestly covariant SSE, with 
CA = 1.5, 5^ = 1.267, /° = 0.0924 GeV and A = 0.2711 GeV as fixed input parameters 



PMH"*"! . The red fine-dashed curve (fit 1) implements truncation at O{l/Mo). The blue 
long-dashed curve (fit 2) has been performed without truncation and with the decoupling 
of A (1232) only partially fulfilled. For comparison, we plot again the systematic envelope 
of figure 16.31 
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A very different approach to calculate the nucleon mass is the Chiral Quark Soliton 
Model (xQSM) |(;()SSn51 lDia97| . In contrast to xPT, this model should perform best 
at large pion masses. Peter Schweitzer of the Ruhr-Universitat Bochum has generously 
provided us with data material m^^^(m,r) from a recent calculation of the nucleon mass 
within this model |Schj . The results suffer from an offset, i.e. mAr(m7r) = m/^^^ (niTr) + C . 
From a simple fit to the four large volume lattice data points below m-jr = 0.6 GeV, we 
have determined the offset to be C = —0.321 GeV. The resulting curve, plotted next to 
our results, can be seen in fig. IA.2I The xQSM curve runs rather close to ours. The 
fact that it comes so close to the physical point gives a very optimistic outlook for the 
performance of the model. 




physical point 
lattice data (input) 
1. data, m^>0.6GeV 
XPT fit 

XQSM, offset fitted 
XPT stat.+ sysL error 



0.3 0.4 
[GeV^] 



Figure A. 2: xQSM result for mN^m.,^) as obtained by P. Schweitzer et al. (red dashed 
line) |Sch| KtOSSOB] . A subtraction of 0.321 GeV is needed in order to account for the 
systematic overestimation of the nucleon mass in the soliton approach. For comparison, 
we plot our best-fit curve and the systematic error band of fig. 16.31 
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Table B.l: Selection of two- flavor lattice data taken from |AK"'"n4 and |()LSn5j . 



no. 


collab. 


/3 




a [fm] 


L [fm] 


[GeV] 


mj\f [GeV] 


large L 


19 


CP-PACS 


2.1 


0.1382 


0.111 


2.68 


0.5214(21) 


1.2751(82) 


X 


41 


JLQCD 


5.2 


0.1355 


0.098 


1.96 


0.5453(91) 


1.300(23) 


X 


36 


JLQCD 


5.2 


0.1355 


0.099 


1.58 


0.560(16) 


1.412(62) 




31 


JLQCD 


5.2 


0.1355 


0.099 


1.19 


0.655(32) 


1.637(82) 




8 QCDSF 


5.25 


0.13575 


0.092 


2.21 


0.5570(70) 


1.320(20) 


X 




CP-PACS 


2.2 


0.1368 


0.092 


2.22 


0.5946(53) 


1.348(13) 


X 


56 


OLS05 


5.6 


0.1575 


0.085 


2.04 


0.6429(68) 


1.377(19) 


X 


oo 




5.6 


0.1575 


0.084 


1.34 


0.660(12) 


1.471(29) 




54 


OLSn5 


5.6 


0.1575 


~0.085 


1.19 


0.709(11) 


1.672(38) 




53 


lOLsnsi 


5.6 


0.1575 


~0.085 


1.02 


0.832(22) 


1.900(39) 




4? 


OT,Sn5 


5.5 


0.1596 


0.107 


1.71 


0.6793(71) 


1.410(18) 


X, coarse! 


20 


CP-PACS 


2.1 


0.1374 


0.118 


2.83 


0.7088(25) 


1.4971(77) 


X 


14 


OCDSF 


5.29 


0.1355 


0.09 


2.16 


0.7172(29) 


1.5062(95) 


X 


13 


OCDSF 


5.29 


0.1355 


0.09 


1.44 


0.7316(52) 


1.577(19) 




12 


OCDSF 


5.29 


0.1355 


0.087 


1.04 


0.826(12) 


1.963(30) 




40 


JLOCD 


5.2 


0.135 


0.108 


2.16 


0.7324(85) 


1.509(18) 


X 


35 


TLOCD 


5.2 


0.135 


0.108 


1.73 


0.7300(90) 


1.510(26) 




30 


JLOCD 


5.2 


0.135 


0.112 


1.34 


0.750(17) 


1.618(65) 




24 


CP-PACS 


2.2 


0.1363 


0.095 


2.29 


0.7345(38) 


1.519(11) 


X 


51 


OLS05 


5.6 


0.157 


0.091 


1.46 


0.746(12) 


1.533(29) 


X 


2 


UKQCD 


5.2 


0.135 


0.105 


1.68 


0.760(12) 


1.657(27) 


X 


46 


OLS05 


5.5 


0.159 


0.114 


1.82 


0.7666(64) 


1.509(17) 


X, coarse! 


7 


UKQCD 


5.25 


0.1352 


0.097 


1.56 


0.7791(76) 


1.629(20) 


X 


50 


PLS05 


5.6 


0.1565 


0.095 


1.51 


0.823(12) 


1.637(27) 


X 


21 


CP-PACS 


2.1 


0.1367 


0.123 


2.95 


0.8304(33) 


1.6434(80) 


X 


39 


JLQCD 


5.2 


0.1346 


0.115 


2.31 


0.8388(98) 


1.650(20) 


X 


25 


CP-PACS 


2.2 


0.1358 


0.099 


2.37 


0.8389(41) 


1.652(12) 


X 


11 


QCDSF 


5.29 


0.135 


0.096 


1.53 


0.8681(64) 


1.721(15) 


X 


4& 


OLS05_ 


5.5 


0.158 


0.124 


1.99 


0.8795(82) 


1.631(31) 


X, coarse! 


49 


jOLS0,5| 


5.6 


0.156 


0.098 


1.57 


0.9002(70) 


1.719(16) 


X 


38 


JLQCD 


5.2 


0.1343 


0.121 


2.41 


0.9037(65) 


1.737(13) 


X 


6 


QCDSF 


5.25 


0.1346 


0.106 


1.69 


0.9207(51) 


1.768(14) 


X 


1 


QCDSF 


5.2 


0.1342 


0.123 


1.96 


0.9398(74) 


1.781(15) 


X 


9 UKQCD 


5.26 


0.1345 


0.106 


1.7 


0.946(12) 


1.878(28) 


X 


22 


CP-PACS 


2.1 


0.1357 


0.13 


3.12 


0.9556(41) 


1.7980(85) 


X 


37 


JLQCD 


5.2 


0.134 


0.127 


2.53 


0.9598(74) 


1.801(15) 


X 


26 


CP-PACS 


2.2 


0.1351 


0.102 


2.44 


0.9694(45) 


1.809(17) 


X 
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Table B.2: Selection of three-flavor meson lattice data taken from jB^Oll lA^04a| . 



no. 


am 


amg 




amx 


a/rl 


m^ (GeV) 


m-j^L 


13 


0.005 


0.05 


0.15938(16) 


0.36523(27) 


0.3782(16) 


0.2567(34) 


3.8 


12 


0.007 


0.05 


0.18881(19) 


0.37268(25) 


0.3783(13) 


0.3040(40) 


3.8 


11 


0.01 


0.05 


0.22421(12) 


0.38304(20) 


0.3814(14) 


0.3580(47) 


6.3 


8 


0.02 


0.05 


0.31125(16) 


0.40984(21) 


0.3775(12) 


0.5022(65) 


6.2 


7 


0.03 


0.05 


0.37787(18) 


0.43613(19) 


0.3775(12) 


0.6096(78) 


7.6 
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